Repository: mahout Updated Branches: refs/heads/master 800a9ed6d -> 6188fa578
MAHOUT-2007 Fix wikipedia xml dump url closes apache/mahout#339 Project: http://git-wip-us.apache.org/repos/asf/mahout/repo Commit: http://git-wip-us.apache.org/repos/asf/mahout/commit/6188fa57 Tree: http://git-wip-us.apache.org/repos/asf/mahout/tree/6188fa57 Diff: http://git-wip-us.apache.org/repos/asf/mahout/diff/6188fa57 Branch: refs/heads/master Commit: 6188fa578acdfb275119a4f85ba128fc0506b4f8 Parents: 800a9ed Author: jackcgai <[email protected]> Authored: Tue Nov 28 15:02:41 2017 -0600 Committer: Trevor a.k.a @rawkintrevo <[email protected]> Committed: Tue Nov 28 15:02:41 2017 -0600 ---------------------------------------------------------------------- examples/bin/classify-wikipedia.sh | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/mahout/blob/6188fa57/examples/bin/classify-wikipedia.sh ---------------------------------------------------------------------- diff --git a/examples/bin/classify-wikipedia.sh b/examples/bin/classify-wikipedia.sh index 6871b0c..41dc0c9 100755 --- a/examples/bin/classify-wikipedia.sh +++ b/examples/bin/classify-wikipedia.sh @@ -76,7 +76,7 @@ if [ "x$alg" != "xclean" ]; then ########## partial small 42.5M zipped # curl https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles1.xml-p000000010p000030302.bz2 -o ${WORK_DIR}/wikixml/enwiki-latest-pages-articles.xml.bz2 ########## partial larger 256M zipped - # curl https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles10.xml-p002336425p003046511.bz2 -o ${WORK_DIR}/wikixml/enwiki-latest-pages-articles.xml.bz2 + curl https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles10.xml-p2336425p3046511.bz2 -o ${WORK_DIR}/wikixml/enwiki-latest-pages-articles.xml.bz2 ######### full wikipedia dump: 10G zipped # curl https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 -o ${WORK_DIR}/wikixml/enwiki-latest-pages-articles.xml.bz2 ########################################################
