Author: lewismc
Date: Tue Jan 27 20:48:27 2015
New Revision: 937922
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r937921, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Tue Jan 27 20:51:49 2015
New Revision: 937925
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r937924, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Tue Jan 27 20:51:16 2015
New Revision: 1655158
URL: http://svn.apache.org/r1655158
Log:
CMS commit to nutch by lewismc
Modified:
nutch/cms_site/trunk/content/index.md
Modified: nutch/cms_site/trunk/content/index.md
URL:
http://svn.apache.org/viewvc/nutch/cms_site
Author: lewismc
Date: Tue Jan 27 22:50:39 2015
New Revision: 1655184
URL: http://svn.apache.org/r1655184
Log:
Create initial directory for Docker images
Added:
nutch/branches/2.x/docker/
Author: lewismc
Date: Wed Jan 28 00:25:58 2015
New Revision: 1655198
URL: http://svn.apache.org/r1655198
Log:
NUTCH-1920 Upgrade Nutch to use Java 1.7
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/default.properties
Modified: nutch/branches/2.x/CHANGES.txt
URL:
http
Modified: nutch/trunk/src/java/org/apache/nutch/util/FSUtils.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/util/FSUtils.java?rev=1655526r1=1655525r2=1655526view=diff
==
---
Modified: nutch/trunk/src/java/org/apache/nutch/crawl/CrawlDbReader.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/crawl/CrawlDbReader.java?rev=1655526r1=1655525r2=1655526view=diff
==
---
Modified: nutch/trunk/src/java/org/apache/nutch/fetcher/Fetcher.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/fetcher/Fetcher.java?rev=1655526r1=1655525r2=1655526view=diff
==
---
Author: lewismc
Date: Thu Jan 29 05:38:59 2015
New Revision: 1655526
URL: http://svn.apache.org/r1655526
Log:
UTCH-865 Format source code in unique style
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/src/java/org/apache/nutch/crawl/AbstractFetchSchedule.java
nutch/trunk/src/java/org
Modified:
nutch/trunk/src/test/org/apache/nutch/segment/TestSegmentMergerCrawlDatums.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/test/org/apache/nutch/segment/TestSegmentMergerCrawlDatums.java?rev=1655526r1=1655525r2=1655526view=diff
Modified:
nutch/trunk/src/plugin/index-more/src/java/org/apache/nutch/indexer/more/MoreIndexingFilter.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/plugin/index-more/src/java/org/apache/nutch/indexer/more/MoreIndexingFilter.java?rev=1655526r1=1655525r2=1655526view=diff
Modified:
nutch/trunk/src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMBuilder.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMBuilder.java?rev=1655526r1=1655525r2=1655526view=diff
Modified:
nutch/trunk/src/java/org/apache/nutch/scoring/ScoringFilterException.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/scoring/ScoringFilterException.java?rev=1655526r1=1655525r2=1655526view=diff
Modified:
nutch/trunk/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/Client.java
URL:
http://svn.apache.org/viewvc/nutch/trunk/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/Client.java?rev=1655526r1=1655525r2=1655526view=diff
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/storage/ProtocolStatus.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/storage/ProtocolStatus.java?rev=1650437r1=1650436r2=1650437view=diff
Author: lewismc
Date: Fri Jan 9 03:53:39 2015
New Revision: 1650437
URL: http://svn.apache.org/r1650437
Log:
NUTCH-1856 Document webpage.avsc and host.avsc
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/build.xml
nutch/branches/2.x/ivy/ivy.xml
nutch/branches/2.x/src
Modified: nutch/branches/2.x/src/java/org/apache/nutch/storage/WebPage.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/storage/WebPage.java?rev=1650437r1=1650436r2=1650437view=diff
==
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/storage/WebTableCreator.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/storage/WebTableCreator.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/plugin/parse-html/src/test/org/apache/nutch/parse/html/TestDOMContentUtils.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/plugin/parse-html/src/test/org/apache/nutch/parse/html/TestDOMContentUtils.java?rev=1650447r1=1650446r2=1650447view=diff
Author: lewismc
Date: Fri Jan 9 06:14:33 2015
New Revision: 1650446
URL: http://svn.apache.org/r1650446
Log:
NUTCH-1907 Incorrect output of Outlinks to Hosts within HostDbUpdateReducer
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/java/org/apache/nutch/host
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/storage/ProtocolStatus.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/storage/ProtocolStatus.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/fetcher/FetcherReducer.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/fetcher/FetcherReducer.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/plugin/protocol-httpclient/src/test/org/apache/nutch/protocol/httpclient/TestProtocolHttpClient.java
URL:
Modified:
nutch/branches/2.x/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/Client.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/Client.java?rev=1650447r1=1650446r2=1650447view=diff
Modified: nutch/branches/2.x/src/java/org/apache/nutch/storage/ParseStatus.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/storage/ParseStatus.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/HttpResponse.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/HttpResponse.java?rev=1650447r1=1650446r2=1650447view=diff
Modified: nutch/branches/2.x/src/java/org/apache/nutch/util/Bytes.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/util/Bytes.java?rev=1650447r1=1650446r2=1650447view=diff
==
---
Modified:
nutch/branches/2.x/src/plugin/parse-tika/src/test/org/apache/nutch/parse/tika/DOMContentUtilsTest.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/plugin/parse-tika/src/test/org/apache/nutch/parse/tika/DOMContentUtilsTest.java?rev=1650447r1=1650446r2=1650447view=diff
Modified: nutch/branches/2.x/src/java/org/apache/nutch/storage/WebPage.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/storage/WebPage.java?rev=1650447r1=1650446r2=1650447view=diff
==
Modified: nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java?rev=1650447r1=1650446r2=1650447view=diff
Author: lewismc
Date: Fri Jan 9 06:34:33 2015
New Revision: 1650447
URL: http://svn.apache.org/r1650447
Log:
NUTCH-1779 Apply formatting to the code
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/java/org/apache/nutch/api/NutchServer.java
nutch/branches/2.x/src
Modified: nutch/branches/2.x/src/java/org/apache/nutch/scoring/ScoreDatum.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/scoring/ScoreDatum.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/plugin/urlfilter-suffix/src/test/org/apache/nutch/urlfilter/suffix/TestSuffixURLFilter.java
URL:
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/indexer/solr/SolrUtils.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/indexer/solr/SolrUtils.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/plugin/lib-http/src/java/org/apache/nutch/protocol/http/api/HttpBase.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/plugin/lib-http/src/java/org/apache/nutch/protocol/http/api/HttpBase.java?rev=1650447r1=1650446r2=1650447view=diff
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/parse/ParseStatusCodes.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/parse/ParseStatusCodes.java?rev=1650447r1=1650446r2=1650447view=diff
Modified: nutch/branches/2.x/src/test/org/apache/nutch/fetcher/TestFetcher.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/test/org/apache/nutch/fetcher/TestFetcher.java?rev=1650447r1=1650446r2=1650447view=diff
Author: lewismc
Date: Fri Jan 9 08:36:52 2015
New Revision: 1650460
URL: http://svn.apache.org/r1650460
Log:
Nutch 2.3 release candidate
Added:
nutch/tags/release-2.3/
- copied from r1650459, nutch/branches/branch-2.3/
Author: lewismc
Date: Fri Jan 9 08:20:47 2015
New Revision: 1650458
URL: http://svn.apache.org/r1650458
Log:
Nutch 2.3 branch
Added:
nutch/branches/branch-2.3/
- copied from r1650457, nutch/branches/2.x/
Author: lewismc
Date: Fri Feb 13 22:05:47 2015
New Revision: 1659697
URL: http://svn.apache.org/r1659697
Log:
NUTCH-827 HTTP POST Authentication
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/conf/httpclient-auth.xml.template
nutch/trunk/src/plugin/protocol-httpclient/ivy.xml
Author: lewismc
Date: Fri Feb 13 22:20:15 2015
New Revision: 1659701
URL: http://svn.apache.org/r1659701
Log:
NUTCH-827 HTTP POST Authentication
Added:
nutch/trunk/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpFormAuthConfigurer.java
nutch/trunk/src
Author: lewismc
Date: Thu Feb 5 18:52:32 2015
New Revision: 1657662
URL: http://svn.apache.org/r1657662
Log:
Update 2.3 JavaDoc link
Modified:
nutch/cms_site/trunk/content/javadoc.md
Modified: nutch/cms_site/trunk/content/javadoc.md
URL:
http://svn.apache.org/viewvc/nutch/cms_site/trunk
Author: lewismc
Date: Fri Mar 13 18:01:43 2015
New Revision: 1666532
URL: http://svn.apache.org/r1666532
Log:
TRIVIAL add license headers to conf files
Modified:
nutch/trunk/conf/log4j.properties
nutch/trunk/conf/mimetype-filter.txt
Modified: nutch/trunk/conf/log4j.properties
URL:
http
Author: lewismc
Date: Wed Mar 4 18:48:32 2015
New Revision: 1664109
URL: http://svn.apache.org/r1664109
Log:
NUTCH-1949 Dump out the Nutch data into the Common Crawl format
Added:
nutch/trunk/src/java/org/apache/nutch/tools/AbstractCommonCrawlFormat.java
nutch/trunk/src/java/org/apache
Author: lewismc
Date: Sun Feb 22 19:54:21 2015
New Revision: 1661539
URL: http://svn.apache.org/r1661539
Log:
NUTCH-1925 Upgrade Tika to version 1.7
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/TikaConfig.java
Author: lewismc
Date: Thu Feb 26 20:33:10 2015
New Revision: 1662559
URL: http://svn.apache.org/r1662559
Log:
NJTCH-1933 remove unnecessary target directory
Removed:
nutch/trunk/src/plugin/protocol-selenium/src/target/
Author: lewismc
Date: Thu Apr 23 21:34:18 2015
New Revision: 1675723
URL: http://svn.apache.org/r1675723
Log:
NUTCH-1994 Upgrade to Apache Tika 1.8
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/ivy/ivy.xml
nutch/trunk/src/plugin/parse-tika/ivy.xml
nutch/trunk/src/plugin/parse
Author: lewismc
Date: Thu Apr 23 22:00:59 2015
New Revision: 948956
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r948955, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Thu Apr 23 22:00:49 2015
New Revision: 1675726
URL: http://svn.apache.org/r1675726
Log:
Add announcement for 2000th issue
Modified:
nutch/cms_site/trunk/content/index.md
Modified: nutch/cms_site/trunk/content/index.md
URL:
http://svn.apache.org/viewvc/nutch/cms_site
Author: lewismc
Date: Thu Apr 23 21:38:36 2015
New Revision: 1675724
URL: http://svn.apache.org/r1675724
Log:
NUTCH-1994 Upgrade to Apache Tika 1.8
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/ivy/ivy.xml
nutch/branches/2.x/src/plugin/parse-tika/howto_upgrade_tika.txt
Author: lewismc
Date: Thu Apr 23 23:55:09 2015
New Revision: 1675735
URL: http://svn.apache.org/r1675735
Log:
Add back in NUTCH-1927 property to nutch-default as revoved during commit
@1675022
Modified:
nutch/trunk/conf/nutch-default.xml
Modified: nutch/trunk/conf/nutch-default.xml
URL
Author: lewismc
Date: Tue Apr 21 15:22:19 2015
New Revision: 1675136
URL: http://svn.apache.org/r1675136
Log:
Add additionaly instructions for pull requests to accomodate coding convention
Modified:
nutch/trunk/README.md
Modified: nutch/trunk/README.md
URL:
http://svn.apache.org/viewvc
Author: lewismc
Date: Wed Apr 22 16:35:25 2015
New Revision: 1675408
URL: http://svn.apache.org/r1675408
Log:
NUTCH-1996 Make protocol-selenium README part of plugin
Added:
nutch/trunk/src/plugin/protocol-selenium/README.md
Modified:
nutch/trunk/CHANGES.txt
Modified: nutch/trunk
Propchange: dev/nutch/1.10/CHANGES.txt
--
svn:executable = *
Added: dev/nutch/1.10/KEYS
==
--- dev/nutch/1.10/KEYS (added)
+++
Author: lewismc
Date: Wed Apr 29 21:52:14 2015
New Revision: 8754
Log:
Add staging artifacts for Apache Nutch 1.10 RC#1
Added:
dev/nutch/1.10/
dev/nutch/1.10/CHANGES.txt (with props)
dev/nutch/1.10/KEYS (with props)
dev/nutch/1.10/apache-nutch-1.10-bin.tar.gz (with props
(Giuseppe Totaro, Luke Sh via mattmann)
+
+* NUTCH-1991 Tika mime detection not using Nutch supplied tika-mimetypes.xml
for content based
+ detection (Iain Lopata, snagel via mattmann)
+
+* NUTCH-1994 Upgrade to Apache Tika 1.8 (lewismc)
+
+* NUTCH-1996 Make protocol-selenium README part
Author: lewismc
Date: Wed Apr 29 21:28:02 2015
New Revision: 1676862
URL: http://svn.apache.org/r1676862
Log:
Nutch 1.10 branch
Added:
nutch/branches/branch-1.10/
- copied from r1676861, nutch/trunk/
Author: lewismc
Date: Wed Apr 29 21:32:17 2015
New Revision: 1676865
URL: http://svn.apache.org/r1676865
Log:
Push updates for 1.10 release
Modified:
nutch/branches/branch-1.10/CHANGES.txt
nutch/branches/branch-1.10/NOTICE.txt
nutch/branches/branch-1.10/conf/nutch-default.xml
Author: lewismc
Date: Fri May 8 04:25:05 2015
New Revision: 1678281
URL: http://svn.apache.org/r1678281
Log:
NUTCH-1934 Refactor Fetcher in trunk
Added:
nutch/trunk/src/java/org/apache/nutch/fetcher/FetchItem.java
nutch/trunk/src/java/org/apache/nutch/fetcher/FetchItemQueue.java
Author: lewismc
Date: Wed May 6 23:32:39 2015
New Revision: 1678111
URL: http://svn.apache.org/r1678111
Log:
NUTCH-2004 ParseChecker does not handle redirects
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/src/java/org/apache/nutch/parse/ParserChecker.java
nutch/trunk/src/java/org
Author: lewismc
Date: Wed May 6 20:43:54 2015
New Revision: 950341
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r950340, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed May 6 20:44:47 2015
New Revision: 1678092
URL: http://svn.apache.org/r1678092
Log:
CMS commit to nutch by lewismc
Modified:
nutch/cms_site/trunk/content/javadoc.md
Modified: nutch/cms_site/trunk/content/javadoc.md
URL:
http://svn.apache.org/viewvc/nutch/cms_site
Author: lewismc
Date: Wed May 6 20:44:58 2015
New Revision: 950342
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r950341, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed May 6 19:30:36 2015
New Revision: 950331
Log:
Update download links
Added:
websites/production/nutch/content/
- copied from r950330, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed May 6 19:14:59 2015
New Revision: 8848
Log:
Remove previous 1.9 Nutch release
Removed:
release/nutch/1.9/
Author: lewismc
Date: Wed May 6 19:06:56 2015
New Revision: 8847
Log:
Release Apache Nutch 1.10
Added:
release/nutch/1.10/
- copied from r8846, dev/nutch/1.10/
Removed:
dev/nutch/1.10/
Author: lewismc
Date: Wed May 6 19:20:32 2015
New Revision: 950323
Log:
Add new links to Javadocs
Added:
websites/production/nutch/content/
- copied from r950322, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed May 6 19:20:18 2015
New Revision: 1678067
URL: http://svn.apache.org/r1678067
Log:
Add new links to Javadocs
Modified:
nutch/cms_site/trunk/content/javadoc.md
Modified: nutch/cms_site/trunk/content/javadoc.md
URL:
http://svn.apache.org/viewvc/nutch/cms_site/trunk
Author: lewismc
Date: Wed May 6 19:27:33 2015
New Revision: 950329
Log:
Update to accomodate new release announcemence
Added:
websites/production/nutch/content/
- copied from r950328, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed May 6 19:27:23 2015
New Revision: 1678069
URL: http://svn.apache.org/r1678069
Log:
Update to accomodate new release announcemence
Modified:
nutch/cms_site/trunk/content/index.md
Modified: nutch/cms_site/trunk/content/index.md
URL:
http://svn.apache.org/viewvc
Author: lewismc
Date: Wed May 6 19:30:28 2015
New Revision: 1678072
URL: http://svn.apache.org/r1678072
Log:
Update download links
Modified:
nutch/cms_site/trunk/content/downloads.md
Modified: nutch/cms_site/trunk/content/downloads.md
URL:
http://svn.apache.org/viewvc/nutch/cms_site/trunk
Author: lewismc
Date: Fri May 8 23:29:45 2015
New Revision: 1678459
URL: http://svn.apache.org/r1678459
Log:
NUTCH-1873 Solr IndexWriter/Job to report number of docs indexed.
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/src/java/org/apache/nutch/indexer/IndexerMapReduce.java
nutch
Author: lewismc
Date: Tue Jun 23 22:32:03 2015
New Revision: 1687145
URL: http://svn.apache.org/r1687145
Log:
NUTCH-2045 index-basic incorrect assignment of next fetch time
(page.getFetchTime()) as page fetch time
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/plugin
Author: lewismc
Date: Thu May 28 19:35:26 2015
New Revision: 953022
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r953021, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed May 27 23:28:26 2015
New Revision: 1682136
URL: http://svn.apache.org/r1682136
Log:
NUTCH-208 http: proxy exception list:
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/conf/nutch-default.xml
nutch/trunk/src/plugin/lib-http/src/java/org/apache/nutch/protocol
Added:
nutch/branches/2.x/docker/cassandra/nutch/plugin/nutch2-index-html/src/plugin/index-html/src/java/org/apache/nutch/indexer/html/HtmlIndexingFilter.java
URL:
Author: lewismc
Date: Thu May 21 17:14:24 2015
New Revision: 1680929
URL: http://svn.apache.org/r1680929
Log:
NUTCH-1923 Nutch + Cassandra Docker
Added:
nutch/branches/2.x/docker/cassandra/
nutch/branches/2.x/docker/cassandra/LICENSE
nutch/branches/2.x/docker/cassandra/README.md
Author: lewismc
Date: Thu May 21 17:15:11 2015
New Revision: 1680930
URL: http://svn.apache.org/r1680930
Log:
update CHANGES.txt
Modified:
nutch/branches/2.x/CHANGES.txt
Modified: nutch/branches/2.x/CHANGES.txt
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/CHANGES.txt?rev=1680930r1
Author: lewismc
Date: Thu May 21 17:24:36 2015
New Revision: 1680932
URL: http://svn.apache.org/r1680932
Log:
NUTCH-1923 Nutch + Cassandra Docker remove index-html plugin as it already
exists
Removed:
nutch/branches/2.x/docker/cassandra/nutch/plugin/
Author: lewismc
Date: Tue May 26 15:41:57 2015
New Revision: 1681781
URL: http://svn.apache.org/r1681781
Log:
NUTCH-2019 ClassPathException sending topN argument for /job/create using Nutch
2.x RESTApi
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/java/org/apache/nutch
Author: lewismc
Date: Tue Aug 18 21:19:07 2015
New Revision: 1696506
URL: http://svn.apache.org/r1696506
Log:
NUTCH-1486 Upgrade to Solr 4.10.2
Added:
nutch/trunk/src/plugin/index-geoip/build-ivy.xml
- copied, changed from r1693938,
nutch/trunk/src/plugin/parse-tika/build-ivy.xml
Author: lewismc
Date: Thu Jul 30 21:29:42 2015
New Revision: 1693507
URL: http://svn.apache.org/r1693507
Log:
NUTCH-1785 Ability to index raw content
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/conf/schema-solr4.xml
nutch/trunk/conf/schema.xml
nutch/trunk/ivy/ivy.xml
nutch
Author: lewismc
Date: Wed Jul 22 04:08:20 2015
New Revision: 1692216
URL: http://svn.apache.org/r1692216
Log:
NUTCH-2021 Use protocol-selenium to Capture Screenshots of the Page as it is
Fetched
Added:
nutch/trunk/src/plugin/lib-selenium/build-ivy.xml
- copied, changed from r1687398
Author: lewismc
Date: Wed Jul 22 12:51:05 2015
New Revision: 1692268
URL: http://svn.apache.org/r1692268
Log:
NUTCH-2063 Add -mimeStats flag to FileDumper tool
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/src/java/org/apache/nutch/tools/FileDumper.java
Modified: nutch/trunk/CHANGES.txt
Author: lewismc
Date: Thu Oct 29 20:52:28 2015
New Revision: 1711359
URL: http://svn.apache.org/viewvc?rev=1711359=rev
Log:
NUTCH-1800 Documentation for Nutch 1.X and 2.X REST APIs
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/build.xml
nutch/trunk/ivy/ivy.xml
nutch/trunk/ivy
Author: lewismc
Date: Thu Oct 22 03:47:04 2015
New Revision: 1709943
URL: http://svn.apache.org/viewvc?rev=1709943=rev
Log:
NUTCH-2148 Review and update mapred --> mapreduce config params in crawl script
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/src/bin/crawl
Modified: nutch/tr
Author: lewismc
Date: Thu Nov 12 15:19:25 2015
New Revision: 1714068
URL: http://svn.apache.org/viewvc?rev=1714068=rev
Log:
NUTCH-2120 Remove MapWritable from trunk codebase
Removed:
nutch/trunk/src/java/org/apache/nutch/crawl/MapWritable.java
Modified:
nutch/trunk/CHANGES.txt
Modified
Author: lewismc
Date: Fri Oct 30 05:05:54 2015
New Revision: 10964
Log:
Dropping Apache Nutch 2.3.1 RC#1
Removed:
dev/nutch/2.3.1/
Author: lewismc
Date: Thu Nov 5 03:08:04 2015
New Revision: 1712705
URL: http://svn.apache.org/viewvc?rev=1712705=rev
Log:
NUTCH-2159 Ensure that all WebApp files are copied into generated artifacts for
1.X Webapp
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/build.xml
nutch/trunk
Author: lewismc
Date: Mon Nov 2 21:57:50 2015
New Revision: 1712170
URL: http://svn.apache.org/viewvc?rev=1712170=rev
Log:
Add new License Key for Miredot and fix formatting for mvn.template
Modified:
nutch/trunk/ivy/mvn.template
Modified: nutch/trunk/ivy/mvn.template
URL:
http
Author: lewismc
Date: Wed Aug 26 02:21:31 2015
New Revision: 1697808
URL: http://svn.apache.org/r1697808
Log:
NUTCH-2083 Implement functionality to shadow nutch-selenium-grid-plugin from Mo
Omer
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/conf/nutch-default.xml
nutch/trunk/src
Author: lewismc
Date: Mon Aug 31 20:51:02 2015
New Revision: 963736
Log:
Update to lua script
Added:
websites/production/nutch/content/
- copied from r963735, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Wed Sep 16 04:22:47 2015
New Revision: 1703331
URL: http://svn.apache.org/r1703331
Log:
NUTCH-1679 UpdateDb using batchId, link may override crawled page
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java
Author: lewismc
Date: Mon Sep 28 18:58:33 2015
New Revision: 1705744
URL: http://svn.apache.org/viewvc?rev=1705744=rev
Log:
NUTCH-2086 Nutch 1.X Webui this closes #61
Added:
nutch/trunk/src/java/org/apache/nutch/webui/
nutch/trunk/src/java/org/apache/nutch/webui/NutchUiApplication.java
Author: lewismc
Date: Sat Oct 3 00:16:04 2015
New Revision: 967561
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r967560, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Sat Oct 3 00:16:40 2015
New Revision: 967563
Log:
Publishing svnmucc operation to nutch site by lewismc
Added:
websites/production/nutch/content/
- copied from r967561, websites/staging/nutch/trunk/content/
Author: lewismc
Date: Sat Oct 3 00:15:40 2015
New Revision: 1706510
URL: http://svn.apache.org/viewvc?rev=1706510=rev
Log:
CMS commit to nutch by lewismc
Modified:
nutch/cms_site/trunk/content/javadoc.md
Modified: nutch/cms_site/trunk/content/javadoc.md
URL:
http://svn.apache.org/viewvc
Author: lewismc
Date: Sun Sep 20 12:50:51 2015
New Revision: 1704128
URL: http://svn.apache.org/viewvc?rev=1704128=rev
Log:
NUTCH-1946 Upgrade to Gora 0.6.1
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/conf/nutch-default.xml
nutch/branches/2.x/ivy/ivy.xml
nutch
Author: lewismc
Date: Sun Sep 20 12:54:49 2015
New Revision: 1704129
URL: http://svn.apache.org/viewvc?rev=1704129=rev
Log:
NUTCH-2050 Upgrade HBase and Hadoop versioning on 2.X HBase Docker
Modified:
nutch/branches/2.x/docker/hbase/Dockerfile
nutch/branches/2.x/docker/hbase/README.md
Author: lewismc
Date: Wed Sep 23 16:55:38 2015
New Revision: 1704896
URL: http://svn.apache.org/viewvc?rev=1704896=rev
Log:
NUTCH-2111 Delete temporary files location for selenium tmp files after driver
quits
Modified:
nutch/trunk/CHANGES.txt
nutch/trunk/src/plugin/lib-selenium/src
301 - 400 of 696 matches
Mail list logo