Author: siren Date: Sat Aug 5 06:45:49 2006 New Revision: 428999 URL: http://svn.apache.org/viewvc?rev=428999&view=rev Log: NUTCH-340 fix two bugs in 0.8 tutorial contributed by Uroš Gruber
Modified: lucene/nutch/trunk/site/tutorial8.html lucene/nutch/trunk/site/tutorial8.pdf lucene/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial8.xml Modified: lucene/nutch/trunk/site/tutorial8.html URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/site/tutorial8.html?rev=428999&r1=428998&r2=428999&view=diff ============================================================================== --- lucene/nutch/trunk/site/tutorial8.html (original) +++ lucene/nutch/trunk/site/tutorial8.html Sat Aug 5 06:45:49 2006 @@ -524,9 +524,9 @@ <h3 class="h4">Whole-web: Indexing</h3> <p>Before indexing we first invert all of the links, so that we may index incoming anchor text with the pages.</p> -<pre class="code">bin/nutch invertlinks crawl/linkdb crawl/segments</pre> +<pre class="code">bin/nutch invertlinks crawl/linkdb crawl/segments/*</pre> <p>To index the segments we use the <span class="codefrag">index</span> command, as follows:</p> -<pre class="code">bin/nutch index indexes crawl/linkdb crawl/segments/*</pre> +<pre class="code">bin/nutch index crawl/indexes crawl/crawldb crawl/linkdb crawl/segments/*</pre> <p>Now we're ready to search!</p> <a name="N101D5"></a><a name="Searching"></a> <h3 class="h4">Searching</h3> Modified: lucene/nutch/trunk/site/tutorial8.pdf URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/site/tutorial8.pdf?rev=428999&r1=428998&r2=428999&view=diff ============================================================================== --- lucene/nutch/trunk/site/tutorial8.pdf (original) +++ lucene/nutch/trunk/site/tutorial8.pdf Sat Aug 5 06:45:49 2006 @@ -327,10 +327,10 @@ >> endobj 50 0 obj -<< /Length 1862 /Filter [ /ASCII85Decode /FlateDecode ] +<< /Length 1870 /Filter [ /ASCII85Decode /FlateDecode ] >> stream -Gau`TgMYb8&:N/30L,Y#7_Ia;GVRR#\7gI.Xjb9Vo?\2m8m4f>.BZ9@<[EMAIL PROTECTED]'u#p=c&[EMAIL PROTECTED](2S;dFc;/[EMAIL PROTECTED],p_:%1I4;[EMAIL PROTECTED](([!HdgA\U[(DbB]t)ciVbqH7-I:i-R%RpX1gr'n[s86%:B&*hWo53-58NuIM[@<\h[EIr7WeGri"F^ap<1`->W(2o4aAIZY_^,^%O&\^Uea*e="T^WU7hZ=1`QZJ'OsIbCs9*F]K9$rN\8L)_4hDpKr8sHBXs2g^k'SAE-.q^AjY]/`oE4`pCQKqV2!#<K\PSqR+p#O%:6L-M;1dXMT/blW'[EMAIL PROTECTED](>[EMAIL PROTECTED]@`+7bTY`]M*9"jh1#8sHB^;KF\KoE'-TbRlXsLp<B4X=EOHTVVQI'\3KeEhc/r*0Xd?i'N$R3-JMi7?cN_2M7MA0X`;_nq-1jjG>G1Qe"K2rZ,7OI#b'Oj?,ZR#KW;W1cg?ERjT)hpj-r>=e([EMAIL PROTECTED](ME?60Y7D2HkB9/MCHK2.fbQ&)GAbo.(W=H$]oloIr=/)YorEL3TgsB(80[,JF4Q!iLd8S%?C=c"me$.oY'm6B,hWg>R<+%;5S>,k?.[!qO`&bZUfL#Yif;p"otkG1A8g#\'I>?QM\=opYB(ti1e%>IE:RLm1s,0C2c_O>^/:[EMAIL PROTECTED]/<<?JWZfHoWo^ul1;0?/6gq]nYM,LOm.m/"mb$,YX'Vl<4XiPR:L(/L([EMAIL PROTECTED]>=P[ATbR66GF!0#U740I["-Ja5a?H-gI6BAJ[_!+cV!No6i)&STVBm#%,lg3=U$fRJ(<H-AfE:DLH43u7KFCHfIWXQhnV80H[:h3]38jI%e9BJmgUD(l5p5$L1mNa)]4P]=a('!^J6]u)9KjAd/>qZ2:<&`M,9\<&N)co=(@FlTlp+3`:CRDt!h%X,01#C<)E?^F! 2fP-JD%L(l5_/PFjb3r8gCs5QW!s?b--sp?,5:l;I71%LaM><MF3TPs=R@'ILQ5VW)3S",AEJ)AUWYOr6-X1NmcgI<V3?/(ibr"]OK2[R:Ok\c3hDh!8NkmN!p`(8Q`s"P"!/mo2^J2LcS"[EMAIL PROTECTED]:[EMAIL PROTECTED],ichIo8!`U9>,[EMAIL PROTECTED];YS=5j<a([EMAIL PROTECTED]:gcW,3S4R+gH_Ut!6+Z9[#(Xeu49'Pn?M"Y612uG+-K1/[DjZ?;o\1NK?_aNNHsm1;gLHTpeRha=5CNGh\`c`tO29k"[EMAIL PROTECTED]/]<)_9MBi>!+k2\?;)dE!JddaBe7J0Ec`fP;\>34_MXdt/ED'XY\C#I\lbu;^*EBunl&(D/6J`Kdui24:H]j'[EMAIL PROTECTED])GY4PIFp/q;[EMAIL PROTECTED]@1EU_F2o((alH<WYlbD7dgUUS(etbocqMpNkqo,J=M68jLdHK0T+jl]`*TLp_,[EMAIL PROTECTED]"&),UR\<qV+o"[EMAIL PROTECTED]';\9oomdZJ8]ERh/Yu.'FgZo#IjOptZi%@:Kit41/9AkM^QQ^$;X_Q&W\hr\)R7gK[f5&"KD4>FIZO.I%q-2:Oc;W%"TFo0e?4SD5E$?S1D/6lZpL:dldJLUt/_*'H!B1FSX9MkmVe?\9gfEYj:2f4BS0>)i6]"fG.`XK]8!YXs1iM?SjRetla4Z-KTiU4Dt+7nLWc+]0PbPjn/h]gtD1m<&8Yff\?;E`G%Z@&f[5R&!bl^)pIVjhaK3$\p6uDp>$3[<64bhGnLLJ::0]OBOI11dZ&k5dIP)76O-OXoAIq"\QW~> +Gau`TD/\/u%0#[%T`P>E#0g<FVO%;[cjX^;@S=2M=?ikhHFD/.&6\0fEngZ4If$'&+4$7C":I*jZ-AJWjH0rp;S&<UI[lAr*)UV\s6iF^:OFf$!r*J8qL'Nd,Vf_>>^OrIYG8-''klf`_VtGW#t3uMcR0mt+fi=E=Rj%Okm_"toZlcD.4,`j1k$Bj[(8"qN;VB_+9&4jBZ^U(od4JCpQs>S)e[oG[X.%@"FbLdj\f_P=/E>UZWunGEW%ni*\J7T\^jndf_(0NCp'm2IG]P9rS^I7*4R.9'$8.QJpmYV`EUQAb`([EMAIL PROTECTED],8KO!RDG9?Kb#jAd$J)7CA&?+!:bB%!%W14G6k%+_]9u$nNrs_CDnGuCXn4#Wdd*9`0df;D`n!qs4KDr6UuQ8ARS6poD/po`At.$u>?fF)<>eB<VZU]G';#_aeNHV<&J>'35a>/!!H3P3rtD%tW/WLt#:f1B?k?<5GbFbK!beLLnU:O9q1Cs*3FSumlIESe?F`Z>AiD'Y8h9?%fpt;l[%?"UKs0n5VIj&2%,JHk93i)Z+4NoGc1O#sYLbH$7g_E%H]B@<:,[EMAIL PROTECTED])?J;g]Q&-bji\GRYL^59b6.f13*"[EMAIL PROTECTED]'rCS`5dSBL:D+mgRP.W45c$)3=Z:\'6*t'G[<_/[EMAIL PROTECTED])2=mK^Y%A2CQ8S5pLA7?7)lho\,[EMAIL PROTECTED])iJ!"Xp0:Ai:(QA-b)r]PWTQf[:e)u`^p^O2k23>FbC:+YZ`"n^%(nc'e0%l:[EMAIL PROTECTED],sdTSt'[+IA#3'&0JHuU]AaGXr"5%6,+:Z]-AVPar3km!Dq`%B[YDsMW\en4H<#"5<77VAgF%I;Vd#^8S>`8O)dJ3nbt*cGSlN-[W/u=4.PCgB40)$N]Gm&KW':,HB:8"#Y6B(N]\U3c^T69;bY<6.Za>q"cX_CUW61WB0`_AFH$%8CFe;3XVEUNHO`7+Am5` Z8se<cIL;4h,"Wf0`]Je2*4qKnV&J$SR1/3a(kYYt^:\j.^tTSMAB3g!A"r#q+50.mDK]/teD-m*qt&N)9Wh<rE2bqHJdsrUFTU\:#b5h1;1TW*6JQ'gI_1'rCm$d_cZ!e`k[VX>3t;7KX%.e#!7PeJ%XEGd3onhJ%b&kS.1-J6Y)-CZ<'C5foimn,/6U7HnIP<7QWLku%=p7GHeoIUenqTj4Ro)Pbr3^=il%p2\<UQarHk''R7bM2IEu<[EMAIL PROTECTED](BB)0VlL.)"rHY(_eFGJ#/Oj.cr3WT7m6i*&JOE<1NL3!%h8?O$?!5k0P%3.VAOgZpi1?2)]ToOa6g%9NSUP1W,pC?ro),DBf%`Vl>`]]=/[EMAIL PROTECTED]'T4k_QXh_dQe3$`D-<!2n;][8!)*H#f44OMCrUKO#]\<@54(0MV":3ZfP2J0,i,[EMAIL PROTECTED];SD;sW//a64fR)eqc5p3FF>!(6LiRYL-dS>e7.bE6e/+;h[5)O71([EMAIL PROTECTED];CnORTA1_];EAHoe`t%K6/gKJ!;2.%:gX;u7.h=i>=d`BOUSpoGLN(a<Na3g3FFd35-2nIF#e,u\2$Ku)>TCZDV`+3hO"QfK1O\;h+B]R[bL.;[EMAIL PROTECTED](:*gnjK=QFqiM(GG(V+QsS#^)>\DpRui',Qkp?SPUo'7Tr]:"pA3-^?ta270V'"[EMAIL PROTECTED]"E%P4%6A2\pWF>B5%1b6k+]a?mI_nD,RRbBIY"C\a'+9/["!Kr+r1NB1.e<[EMAIL PROTECTED]>JX'CkCkOao>d8Zd(?M^D;mZ&'[EMAIL PROTECTED]>/oTFYt]73I7S$#=>7d49~> endstream endobj 51 0 obj @@ -586,35 +586,35 @@ xref 0 73 0000000000 65535 f -0000021815 00000 n -0000021915 00000 n -0000022007 00000 n +0000021823 00000 n +0000021923 00000 n +0000022015 00000 n 0000000015 00000 n 0000000071 00000 n 0000000935 00000 n 0000001055 00000 n 0000001150 00000 n -0000022152 00000 n +0000022160 00000 n 0000001284 00000 n -0000022215 00000 n +0000022223 00000 n 0000001421 00000 n -0000022281 00000 n +0000022289 00000 n 0000001558 00000 n -0000022347 00000 n +0000022355 00000 n 0000001695 00000 n -0000022413 00000 n +0000022421 00000 n 0000001832 00000 n -0000022478 00000 n +0000022486 00000 n 0000001969 00000 n -0000022544 00000 n +0000022552 00000 n 0000002105 00000 n -0000022610 00000 n +0000022618 00000 n 0000002242 00000 n -0000022674 00000 n +0000022682 00000 n 0000002379 00000 n -0000022740 00000 n +0000022748 00000 n 0000002516 00000 n -0000022805 00000 n +0000022813 00000 n 0000002653 00000 n 0000005431 00000 n 0000005554 00000 n @@ -636,28 +636,28 @@ 0000014215 00000 n 0000016140 00000 n 0000016248 00000 n -0000018203 00000 n -0000018326 00000 n -0000018353 00000 n -0000022871 00000 n -0000018528 00000 n -0000018691 00000 n -0000018886 00000 n -0000019133 00000 n -0000019371 00000 n -0000019631 00000 n -0000019869 00000 n -0000020082 00000 n -0000020432 00000 n -0000020659 00000 n -0000020886 00000 n -0000021042 00000 n -0000021155 00000 n -0000021265 00000 n -0000021376 00000 n -0000021484 00000 n -0000021590 00000 n -0000021706 00000 n +0000018211 00000 n +0000018334 00000 n +0000018361 00000 n +0000022879 00000 n +0000018536 00000 n +0000018699 00000 n +0000018894 00000 n +0000019141 00000 n +0000019379 00000 n +0000019639 00000 n +0000019877 00000 n +0000020090 00000 n +0000020440 00000 n +0000020667 00000 n +0000020894 00000 n +0000021050 00000 n +0000021163 00000 n +0000021273 00000 n +0000021384 00000 n +0000021492 00000 n +0000021598 00000 n +0000021714 00000 n trailer << /Size 73 @@ -665,5 +665,5 @@ /Info 4 0 R >> startxref -22922 +22930 %%EOF Modified: lucene/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial8.xml URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial8.xml?rev=428999&r1=428998&r2=428999&view=diff ============================================================================== --- lucene/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial8.xml (original) +++ lucene/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial8.xml Sat Aug 5 06:45:49 2006 @@ -350,11 +350,11 @@ <p>Before indexing we first invert all of the links, so that we may index incoming anchor text with the pages.</p> -<source>bin/nutch invertlinks crawl/linkdb crawl/segments</source> +<source>bin/nutch invertlinks crawl/linkdb crawl/segments/*</source> <p>To index the segments we use the <code>index</code> command, as follows:</p> -<source>bin/nutch index indexes crawl/linkdb crawl/segments/*</source> +<source>bin/nutch index crawl/indexes crawl/crawldb crawl/linkdb crawl/segments/*</source> <!-- <p>Then, before we can search a set of segments, we need to delete --> <!-- duplicate pages. This is done with:</p> --> ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-cvs mailing list Nutch-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-cvs