Author: cutting
Date: Tue Apr 19 11:58:12 2005
New Revision: 161952
URL: http://svn.apache.org/viewcvs?view=rev&rev=161952
Log:
Deprecate link analysis. Remove it from the tutorial and change the default
configuration so that link counts are used instead.
Modified:
incubator/nutch/trunk/CHANGES.txt
incubator/nutch/trunk/conf/nutch-default.xml
incubator/nutch/trunk/site/tutorial.html
incubator/nutch/trunk/site/tutorial.pdf
incubator/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial.xml
Modified: incubator/nutch/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewcvs/incubator/nutch/trunk/CHANGES.txt?view=diff&r1=161951&r2=161952
==============================================================================
--- incubator/nutch/trunk/CHANGES.txt (original)
+++ incubator/nutch/trunk/CHANGES.txt Tue Apr 19 11:58:12 2005
@@ -63,6 +63,14 @@
12. Close Issue #33 - MIME content type detector (using magic char sequences).
(Jerome Charron and Hari Kodungallur via John Xing, 20050416)
+13. Add a servlet that implements A9's OpenSearch RSS web service.
+ (cutting, 20050418)
+
+14. Remove references to link analysis from tutorial, and enable
+ scoring by link count when generating fetchlists and searching.
+ (cutting, 20040419)
+
+
Release 0.6
1. Added clustering-carrot2 plugin, together with introduction of clustering
Modified: incubator/nutch/trunk/conf/nutch-default.xml
URL:
http://svn.apache.org/viewcvs/incubator/nutch/trunk/conf/nutch-default.xml?view=diff&r1=161951&r2=161952
==============================================================================
--- incubator/nutch/trunk/conf/nutch-default.xml (original)
+++ incubator/nutch/trunk/conf/nutch-default.xml Tue Apr 19 11:58:12 2005
@@ -257,7 +257,7 @@
<property>
<name>fetchlist.score.by.link.count</name>
- <value>false</value>
+ <value>true</value>
<description>If true, set page scores on fetchlist entries based on
log(number of anchors), instead of using original page scores. This
results in prioritization of pages with many incoming links.
@@ -382,7 +382,7 @@
<property>
<name>indexer.boost.by.link.count</name>
- <value>false</value>
+ <value>true</value>
<description>When true scores for a page are multipled by the log of
the number of incoming links to the page.</description>
</property>
Modified: incubator/nutch/trunk/site/tutorial.html
URL:
http://svn.apache.org/viewcvs/incubator/nutch/trunk/site/tutorial.html?view=diff&r1=161951&r2=161952
==============================================================================
--- incubator/nutch/trunk/site/tutorial.html (original)
+++ incubator/nutch/trunk/site/tutorial.html Tue Apr 19 11:58:12 2005
@@ -396,10 +396,6 @@
<pre class="code">bin/nutch updatedb db $s1</pre>
<p>Now the database has entries for all of the pages referenced by the
initial set.</p>
-<p>Next we run five iterations of link analysis on the database in order
-to prioritize which pages to next fetch:</p>
-<pre class="code">bin/nutch analyze db 5
-</pre>
<p>Now we fetch a new segment with the top-scoring 1000 pages:</p>
<pre class="code">bin/nutch generate db segments -topN 1000
s2=`ls -d segments/2* | tail -1`
@@ -407,7 +403,6 @@
bin/nutch fetch $s2
bin/nutch updatedb db $s2
-bin/nutch analyze db 2
</pre>
<p>Let's fetch one more round:</p>
<pre class="code">
@@ -417,10 +412,10 @@
bin/nutch fetch $s3
bin/nutch updatedb db $s3
-bin/nutch analyze db 2</pre>
+</pre>
<p>By this point we've fetched a few thousand pages. Let's index
them!</p>
-<a name="N1018D"></a><a name="Whole-web%3A+Indexing"></a>
+<a name="N10186"></a><a name="Whole-web%3A+Indexing"></a>
<h3 class="h4">Whole-web: Indexing</h3>
<p>To index each segment we use the <span class="codefrag">index</span>
command, as follows:</p>
@@ -431,7 +426,7 @@
duplicate pages. This is done with:</p>
<pre class="code">bin/nutch dedup segments dedup.tmp</pre>
<p>Now we're ready to search!</p>
-<a name="N101A8"></a><a name="Searching"></a>
+<a name="N101A1"></a><a name="Searching"></a>
<h3 class="h4">Searching</h3>
<p>To search you need to put the nutch war file into your servlet
container. (If instead of downloading a Nutch release you checked the
Modified: incubator/nutch/trunk/site/tutorial.pdf
URL:
http://svn.apache.org/viewcvs/incubator/nutch/trunk/site/tutorial.pdf?view=diff&r1=161951&r2=161952
==============================================================================
--- incubator/nutch/trunk/site/tutorial.pdf (original)
+++ incubator/nutch/trunk/site/tutorial.pdf Tue Apr 19 11:58:12 2005
@@ -265,10 +265,10 @@
>>
endobj
42 0 obj
-<< /Length 2131 /Filter [ /ASCII85Decode /FlateDecode ]
+<< /Length 2071 /Filter [ /ASCII85Decode /FlateDecode ]
>>
stream
-GatU4D3*F0%/ui*i=?$:[EMAIL
PROTECTED]@Ul!HhitdH]AeCgV@@rV4d#*E.f[QOO,$Q6T8f6h2DbQZnXcmNu9g/r;QaJ[=,[EMAIL
PROTECTED]"lo/2Lui&Ot%ZcTD7QZnYXD<0(+Qb15TKsT9Ki?*3N-UgQ-lJ/M45H`RE&uD3^f=p#;Eg=k?4+B;85OTWN+UQ-OJPn,P198Rcf^q(@.,3ToMXWU!OggUlM+9gafBRhXBpGUeH6[&]_n27\?<0*>$C*2u\([EMAIL
PROTECTED],MbCTZ6TER7;oA9!a?&)EYdcYqWsi:!kNT3KjmZNA1;WjBkIHmuKgcK4RG>Fl<cVJFO2Ea;PeZIi5ON_-$0$7sn+hcm^;]VsjWY+(^k/[EMAIL
PROTECTED],8+uYO5j4iDHc(kR.fr=p!+XdNpCHsr8r<#+-WKf+S6*6bEX`bJ6kDc+\N!0bGBA^C.`;6?_P[VZ"l^J(4YmR,$4Z=Y5.j1r/p+ha'9/O6Wp>%Wm7]JS,R?cafd3<#ZNeHI6E._HP<(=,=5[nbO'<]LVtJKk=[#ABQG7bCrqGh]SE!lD:-RBYJ$l;Viat5Gf(g!$ol>BYHn<eOE.d?b]e:X'M-!<Y%XLO)h'u\SLc,OsFcClBnkGG41!6"G6]4!pkNbIXZF%,WTi/6i<EU3`&>pp\o7]U=!mEKi$c74bMg-[DAE#bj1<tCIN&'10h='4G9SH?dQUVs()&n9PYMq4q%'ks6KWTjm"YFp^4@'i9+MnZ7.2F6_bU$TBEQ2T>X+:qqB[Y"SlllcPds7Nb:_-+g6QM?OdjX(E9/u>"0'hjq'n^%d"?Z(^S3ThdV7i(B>N0WeSIr:[EMAIL
PROTECTED];VPkFMf:Sa?B)284]F85arFFS&6']Jiq5"l>@<1mN.-m";Es:]<cj$RlT-P5O^$AO"tE*iS)[EMAIL
PROTECTED]&/)I]ZjE([EMAIL
PROTECTED]'Rf(+Q$.YGcfqIgl<A2)$?eL[Y.0Zl*+^$lWT]![R@/QfWi56;6<;RnMsR2^*`865X>S&i:D7(/ct&'[EMAIL
PROTECTED],G;0N..:KHf$*/j_&4I`[-LTu2GXe>!#(Dlr>67o#KN2BOF,6(=Mu,U\6805>S7r/aJd:u5gc(l0&MUXdOK\_VijnDg(-#E;OJ+3Qe.:cDX)$]_;i%&2S1?lC9*5^RN>Sp&O20Ll%,[EMAIL
PROTECTED],U/23\/`kj>DEpR3Gn7!B`L"Z#JAL=gem;M)Cc-"GJn\<<[EMAIL
PROTECTED])gX\,M@>rmb*\=4,7c+bPDT_\TN^1NXEjQNS[uA:26;5aNlZQGI't>BGpF#Lgo#:,)tbFgi7kbA')nB:hu+'N/&j^(u'<D8aBbS(6.tGL:['lj8*CbMp[6b7glVf_DPO8"&l,k-s&-iOg$e=PL0LqfOurpq?,4"=SDo%NBuDdAL2?o4N,U\bgKotN=M\N_(auei+;+ZdNe;.r/g^N-dGA!0Cg85pr.YB-j\9Fn!3eLF#sKm$L2'r:;([EMAIL
PROTECTED](t,lKjf9#tc\H/3=XhKFh5M/SCuZT[]&,SK'mU'5D3XZ!V%MARM_cAmg[Un-`V+UAf!PH2_+fP)_b<r'[EMAIL
PROTECTED])OCT/S972&8D8ZZIZIBrPK_OVe7;Zj>>'!FmnolucL*S#qY)>(.gl*[biI3VLrMTkh\tCGd_Vq"Yn2^s8%$7ICH^=1BK.JCFoBVuL,\Zs6-RZh[`,Xtiup1E1G_UPqp"M"j&QX^ec$W=/B,</7?t_lRDUOAoduOQfo<5*e0eEuP?(6AudHf825&U3^0["[EMAIL
PROTECTED]/&No8mnK_K_!Tp^JV"bBDP</Sj\\E5:i44:sQ5X&<h3,pFj.;QYOS`8h(a:90!$kGk'$uG5\o&@6NK7!*34`eh\I<2a$^cScNq[m7SkQCl*m0hpc%\BP8\bN3/ra*5'>iGrA\TT$F(037%=nE2t#8[`6OpHqe"TPap^0a.KiB!"LmG&"1iWWS[DmgAF:#l[0KH8n(r*s%qC:.;HK<0Dqcs[**,_f=NrG<?8'_*0H5N[l='5R7&[Me8^j<[EMAIL
PROTECTED]"p?iQqAL[;id,"[EMAIL PROTECTED]>~>
+Gau0DD/\/e&H;*)Taq6oC^htRUg=Dp_m1luL,Rjk%gBBM-uIA?<`r>n&hh4Qr(aP.*Gs5Ep4<@F.$;PsH1&4)?dC&`bU/r0/r;QaJ[=&p]p_>[EMAIL
PROTECTED]&RWSk*)Dbf<`Tk-?<esA[\,=E0`O;r1hR*U^0OWS_;$Ym:,Z9:<H5mb'2Sn(\.FOq;%J;G]81g,S\6#Q!/$(<`f7)ktU)4[_#TWiGh1/D^U/NO+7M>e,A:D3`";V<m>!1n#l8pW_6J/8[]Ppen]u"WVO&K.;Lr1Q.i8^*9rsAE'4i>hY%g&-f[paTr[`H*EPfi4u\aan\4O=;(>^YOlKrK[a3!Zg(a4t\;Ydr?hE'ENk:)MEP5b9Hpg<Y&pVZA&I'Lg_9VUsP"i9[?ED14%e_+i%*r"[EMAIL
PROTECTED](!_S%EqmPDN+75tj'&V;:,hX8S;<R%tCT*mO!YB)/A/sXnPUTc^)30a^T>BatLp'hW9KV#Z1p2=RpDSHcQMKZ.1^[QeY4M(H1Z;;p+:LPKQ[mr<"p.:$V>gHglF;p]J[9LUmpaqA/m$?Tnkfb"r8jEW^\5.r*9VqJo$Q=+Mfi;A0;d3CPCFa$\XMA\RSmDk2i-G>gHqTuGaXe%J0'fqDN_:l0%qKSD]8O"cF6rF99_Fl6]39qjI;lC<FlKg1.fkC9r_B,#+;3%c'K1haC2[->S&e1?-&;2OGS13AXreq).g>@]&1/glO'DOY;@5.);>;[?HZ,V#$IrXnneH]$X2t.p]tH8J-\A!Vc`/[EMAIL
PROTECTED])%Shmf*<KEZsUT'3=A</[EMAIL
PROTECTED]:I01LG<0&a1(]5"W)8MNiG8[.fHb6d7Z:>[EMAIL
PROTECTED]/q,X(G\;g'Xbc8aXlF(?'(L]eZh<*E4ktb+)H(fVq!-`#IKNBRpXXM8#p8<BfI]<cj<ZQVCN6OBXZZ#)Pb3NH8*2c]Ie,gh]r(nOOrbGa/\nIOncM&/'2k["XE3Y`c<9)pBo,\l>&`+Q&>%nQO-%\/dS\)9ks&@[EMAIL
PROTECTED]<mAb[%LE5lr[Z,W7OM:p`-<T:[EMAIL
PROTECTED]>S0i:D7(/ct&'-heOL"2bCkYH`;PWS(tLUh>$!,G;/cPkQh5BBEp:i/[EMAIL
PROTECTED];[EMAIL PROTECTED])#Gc0npYV'[EMAIL
PROTECTED]::M#[CeFsM33l0JS%P)*Ds,RHGFXOA4m(<a,RC]4lKF'WQAZ!!t.9O8's"RYXbRGS+cEU5<+W+k#9-!fOH)F&I&9^c2Lu5/^I%#WjK24SXd,[EMAIL
PROTECTED]/k\9k9;[EMAIL
PROTECTED]'!chVBG>ORb#%nj3o3<<kXC;;aD*Bd)L92q'\\N1)!10SDAJD(cn.:P6.B45sGmd40h2X$Ao\u=Np(M':1T8:V_YJi'ZZUk8G+oF>^"[EMAIL
PROTECTED]/[EMAIL
PROTECTED]"9%$+RIl@&ET%HEa[B8%*/b]]PKe_YC[.d([`QEDhdVd*,mbnmLt8oWRHbr%gP_ro&_*;VI%V^]].-T+24FFN=^K^AZ^/_#&_0SA%qjlga(Dj!/YD.6:$bRR.=+9d6dU9eEqoYs7d/8^H`uq]:S^pdiea[aS7r]/2s8OCr4dU+L([EMAIL
PROTECTED]@)oe(hnj$)8Y^f#h_YIVrf_<7T,h_3uS!$C7q&fbsm.1CWVE>,.RhN8.QbVXHZCh!"":V`kDm]C0rEj'mb8cNsdA%%.F4O#V*<4$ipXRGd)sn&nq7QE!p;i[b.,ZD?&`2X$fnN(nq\pdT>6LN,1PndK`a``0=1R(Tr/(,/nN+*dF,ZZEp-BVGMQd+B\MLd:QQ#+R*c;rU%*PP/MoB&/J?3fY]d[8:f>[EMAIL
PROTECTED]<+s#bg](gZfCE<@$iqkuc`.,(%>LR?G`$\_5F"P#2Xs&ad.:On&e3f_bX,74bC"^I]B7k#!rj9m>g*W:[EMAIL
PROTECTED](fu-T[9J#[a9>nbc^4<*QKZRf4#b8BaYSO/mF$BkkjGV#>#hT8+8[I*gu$16#s\~>
endstream
endobj
43 0 obj
@@ -297,10 +297,10 @@
>>
endobj
46 0 obj
-<< /Length 1728 /Filter [ /ASCII85Decode /FlateDecode ]
+<< /Length 1777 /Filter [ /ASCII85Decode /FlateDecode ]
>>
stream
-GauHLgQ(#H&:O:SW'[EMAIL
PROTECTED];3q92a9LHE7BpR=7f1Oa&DEjqTsEnI'%]R6Dsun%%;!]UU1b-H,>t$LcHM\4,<sC!hp*Qu:2CQSPkrtrnFYN^[#&XFSd0aN?9R1m+gIH#8S^NcC%<sI9HQW[<c;TnM%-c?ZJcn5)G(t]T3AR\p"T.%:8_f//ClbQ`3.s'[EMAIL
PROTECTED];9426ReP.=>]""TjSdAPTsdbqP1m?mA,U;i^)pI"!tt+CC<[EMAIL
PROTECTED](O+iB',Y:3=NB%"G_oA'KXApqCFj>pI1aL9eNQG(5b:t;d8DNaNQJ2\^MM$bX7s+;U,b0q\'WFXsIuiZj;`K3d(1dpnFQJ'@Gi1L6IuqWLPnV.pQeS?/A\Zlr%';9bVU:maenNKOT+aHQ;!S3-Je?MtpB8E\cMrl+%Z'd^9]XKKN*'C0QC8EO(^0oE1qXg5AgMt0B0<E]&8Te[!b1!U;YkTnFXtDa/Rfgq4U?sgN53&*EE`"XS\4eE#LJc6^EN*Y*EK2_>bpTE<ifh,qXpLpi\ZgJ@<oEZlJRhD;9.`1GDJalj-!&?V#AoQ9'[EMAIL
PROTECTED]&[EMAIL PROTECTED](&'FWVL_I6?mSU`dQjO5efa'H1]rEoT"[EMAIL
PROTECTED];U;KG55_9j"^ln5dR:/#0_B*rOC?)P`J2CMs-ieW3IskT<Rf">)`Eq$%:eoYL):!",[agW7MM-CRLnncfJ'"[EMAIL
PROTECTED]>bVBgE$Q*"TT`6`:#^lB>JiLmC%QqJUUmcL!pp.F&[Z!"`Pq[/N&dJ(>jm3JO9FDd&9E@)W'?m)8:m&'H<P`I'V"LCF?9IigA:K<Fu]7u>$I=;g7&c&[EMAIL
PROTECTED]:n-m-dCL'q[R"e:6BVFNI$7,2+6QXFH1+uIjjG*"/\JS->c6o/h7:-rkrRc+Z^/,PQ=`%P's!rr`XaSV=ipIhS?0-JhlOW4Idq2G)S,U0Wh'.'A6Jqg"[EMAIL
PROTECTED]:]/R$L0(Op*!_f<f7^P,&jUWS^NIeV%WSYd[_$sk4dNbF^pV\3SI!3\lN2?<>(:<%1!PWI#B:rsqkD5TDEeU3k%GDpta;7F`W$u==eLisU]p/%?ng_d_YJ^EZ,]Zp%gUZK(E-_>%pF=Zu/cI<%XdG-Xp9G=fFagA"mN$_7,cdTl+Yn;dNk!jN*N'g5I]"&q:7$?KOT23pi4eG%/G.?;s0au>+fS?b]_.qS;#L.(;[EMAIL
PROTECTED];9G)h>"QHKO0kg/STf_?9($*Ze?0(>QQ$/oQ<k"PpW"a#UeW,@Q;b-D<"TH\/[EMAIL
PROTECTED]:_hlaslUWV%^&ciX_9GB3N4QbSX>%V:2#2%([EMAIL
PROTECTED])5,8A3N.]'V+-XLq+$1kskKM,\6jRFif^]6>+/BdspQfb_>oeC)E?S:R?Up.1[q"qHT3KRVBUpMjX6)!:[EMAIL
PROTECTED]'is=WKj&[EMAIL
PROTECTED]:hf*[ZrhNZtfSlrrr0K8<0RO(.p07RCVa5YXc;I<=Yhh#IU%4'bldDSq0\^Q!dkfH0Yb*"L%`*G(*8o[?-QuBoW'>RN+/m350&F!h0>:G<,LgOE-_l4Dk=AU)cJUWU2m-=6lU4o`Caqr<cYcpT'T`R`/Lp6?rI1Q3D4Bg3KcK/V6l$.7%FfB[A;FgGrH\C8d:-l(dq_q=<tl~>
+Gat%$gQ(#H&:O:SW'[EMAIL PROTECTED]"p6P;4AMIP%HNV9%p&DEjqTsElS<[EMAIL
PROTECTED]<1Aj7n[J'TO""_4783F9Ck`KX1P#QjcOA:[EMAIL
PROTECTED],&g:lg2l*2LcNiXSue9Sl(S\*ef*fCF*O[6KTkChB+D:[EMAIL
PROTECTED]<[Z)L%_a!+cJM3Sn^^Q=^X#IcM:TJgg>8&:cDBQ0G;QiO%QdU^;dl[g\aO??^lqDXUE7c)-?KSkf2!<JfUp)jUN9]@'&ts9HF))2/VcE3f'FK5,GUK`TeGG^d'0o]qEKALY_qMs^mr.XDZ2_&>*W\jP'n3*(.En#I*o!s_0PZC"bMPdpj.OO.fkq?e%f96R"'s#;[EMAIL
PROTECTED]'QIu.dbG>QK>./]:RJ)EiJ,4a*O&F?Pq#:"maXdnA2p3#4O"Q<&!khZmuAQF*qRE=Jj;]e"&2=<#c.FnXi^4k5qh\^HQQu9Bth-2P_^3.8nlg.bbU;'iMu7CLS5ZiX&;$a*F4KI>[EMAIL
PROTECTED]/J]G>++1V;K.Q"Kq!tA134kICDJ,b#ulB2/U0s"[EMAIL
PROTECTED]:9qH$oLNIM<[EMAIL
PROTECTED]'t)%[[8K9hEA,m8pbm>8.!3J)RVf1otuCfmYVo0sI^7gf9Lk[`7W>Usa5p#d6[W'q+iFJ^X919'tM\l/=SBq6lHp40?1Ae9q%bsN\$=:V=d?HFo=l2a+i/1+55'u_:Q^-A1>6pm'^NM,[EMAIL
PROTECTED]&$cNAH9r8tO$9ct</aBCfoYLN25),g+/;UZ%(gjpO`tMpC)[EMAIL
PROTECTED]'k1\%_i?L;Z_&4:J_EedTdO9W(rpTHh($-NR.2\j_n:_P!_*.b`J6<O='EDn/Wt0AqE?-DO$mFeH"L%([EMAIL
PROTECTED]')</e]MgHc4%iT`'$0T8X2nZ=XJ(fO-Co\87ED]L.7A+1hg0VV!MT1dfB!3G%-3rkRHRSA'[EMAIL
PROTECTED]<"#C"MTGHY2<1tm_oD<[EMAIL PROTECTED],[EMAIL PROTECTED]"[EMAIL
PROTECTED]&T-]X84s5o_PWGHIAB-ri)_"2MfII%[5)p?Xsas-p32<spfd]+k9c<j>qPr#&Mm$:=Mo?-/U#k&CrGDg(<6gJp#*Y'f_F4f!lIUCAbIJgU"2&`o^M;PM?/(Dq8k4=J;TO(YoT''6l9DHGdUZ5V"NP<tsZNkHDIUl55>4(*iY!AXaq^c]'PPHo8VFuPqkE:Xgd([EMAIL
PROTECTED]&>?re_Pa[Kfq!qUq&>[EMAIL
PROTECTED]):\(+VJr-mPoYq)SL9CR16r[uBSn15bnMN+9_A\CdmO.U0cDbS8DH]t&cnrB_(fh-^<u+]VAaAscC<>':$,!#?Wl;DL>7T'?+;kJBhH!04K5]6s+:7c`2lLd[]kcacds1q,`2JnmSH.HR[0pqU2%WZ:[bt7'+?e7!5o$I6$rp6+R78jrJf$r,_GcGKD4$<ed3-3n*%Q["So:0G(2_p6en-YhFBDso?IgCSV`9/22HpVS8FP_)I;IB$HU9(#Q]>(Ih4Y9O>(Ei?I]H]R1%*59e2t^$"]H'9":sd]'bd)`i[((A*eEZISh[?>[EMAIL
PROTECTED]@!%&l[5^F#*I]%I,p.n,gg"JaDB.hkVifqUEeFhMgn/=bO5!PLDKuT,?]W8eh6!lE.6"95Qe/fO_,s(@<_peu1;#;?HLrHL~>
endstream
endobj
47 0 obj
@@ -309,33 +309,18 @@
/MediaBox [ 0 0 612 792 ]
/Resources 3 0 R
/Contents 46 0 R
+/Annots 48 0 R
>>
endobj
48 0 obj
-<< /Length 422 /Filter [ /ASCII85Decode /FlateDecode ]
- >>
-stream
-Gar>B]5GM/']&SBgFX;#0,P,Cel6dg%TOV-g(#P$960CK`ejQdF^c^p<=J(r3"HMbPQZgG`[O6C'81rk^i;)bEL]I\dg-,Q48;pRA_&EkQHRb!i_]<5?,@DR:ZNZFhdiItgER7V(_+ArUbaJiq@:"H'_ulJ"&^]c*9!I$P')'oG6M4R3"[f2C$g?a"Eh=2FkQJ3V,:_Va7/,[EMAIL
PROTECTED]:d-]pZV$>KCEXKO&]HRp%>l<?HBrn!m%um:.".8_K5J6Ce:0P/aLOirMCJqlm"U`tkVC5W[g*6:B./ni=E0/;/Y)cG9T_?IaH)8Ns&%%+OBrAi8fC24oQW_t1AC28VlP^'3iu11R6qrk3]-iI"[EMAIL
PROTECTED]>W(3n(#L;2>LU021%b[/ZidLnj_3L#~>
-endstream
-endobj
-49 0 obj
-<< /Type /Page
-/Parent 1 0 R
-/MediaBox [ 0 0 612 792 ]
-/Resources 3 0 R
-/Contents 48 0 R
-/Annots 50 0 R
->>
-endobj
-50 0 obj
[
-51 0 R
+49 0 R
]
endobj
-51 0 obj
+49 0 obj
<< /Type /Annot
/Subtype /Link
-/Rect [ 141.336 660.8 244.02 648.8 ]
+/Rect [ 141.336 194.134 244.02 182.134 ]
/C [ 0 0 0 ]
/Border [ 0 0 0 ]
/A << /URI (http://localhost:8080/)
@@ -343,137 +328,137 @@
/H /I
>>
endobj
-53 0 obj
+51 0 obj
<<
/Title
(\376\377\0\61\0\40\0\122\0\145\0\161\0\165\0\151\0\162\0\145\0\155\0\145\0\156\0\164\0\163)
- /Parent 52 0 R
- /Next 54 0 R
+ /Parent 50 0 R
+ /Next 52 0 R
/A 9 0 R
>> endobj
-54 0 obj
+52 0 obj
<<
/Title
(\376\377\0\62\0\40\0\107\0\145\0\164\0\164\0\151\0\156\0\147\0\40\0\123\0\164\0\141\0\162\0\164\0\145\0\144)
- /Parent 52 0 R
- /Prev 53 0 R
- /Next 55 0 R
+ /Parent 50 0 R
+ /Prev 51 0 R
+ /Next 53 0 R
/A 11 0 R
>> endobj
-55 0 obj
+53 0 obj
<<
/Title
(\376\377\0\63\0\40\0\111\0\156\0\164\0\162\0\141\0\156\0\145\0\164\0\40\0\103\0\162\0\141\0\167\0\154\0\151\0\156\0\147)
- /Parent 52 0 R
- /First 56 0 R
- /Last 57 0 R
- /Prev 54 0 R
- /Next 58 0 R
+ /Parent 50 0 R
+ /First 54 0 R
+ /Last 55 0 R
+ /Prev 52 0 R
+ /Next 56 0 R
/Count -2
/A 13 0 R
>> endobj
-56 0 obj
+54 0 obj
<<
/Title
(\376\377\0\63\0\56\0\61\0\40\0\111\0\156\0\164\0\162\0\141\0\156\0\145\0\164\0\72\0\40\0\103\0\157\0\156\0\146\0\151\0\147\0\165\0\162\0\141\0\164\0\151\0\157\0\156)
- /Parent 55 0 R
- /Next 57 0 R
+ /Parent 53 0 R
+ /Next 55 0 R
/A 15 0 R
>> endobj
-57 0 obj
+55 0 obj
<<
/Title
(\376\377\0\63\0\56\0\62\0\40\0\111\0\156\0\164\0\162\0\141\0\156\0\145\0\164\0\72\0\40\0\122\0\165\0\156\0\156\0\151\0\156\0\147\0\40\0\164\0\150\0\145\0\40\0\103\0\162\0\141\0\167\0\154)
- /Parent 55 0 R
- /Prev 56 0 R
+ /Parent 53 0 R
+ /Prev 54 0 R
/A 17 0 R
>> endobj
-58 0 obj
+56 0 obj
<<
/Title
(\376\377\0\64\0\40\0\127\0\150\0\157\0\154\0\145\0\55\0\167\0\145\0\142\0\40\0\103\0\162\0\141\0\167\0\154\0\151\0\156\0\147)
- /Parent 52 0 R
- /First 59 0 R
- /Last 63 0 R
- /Prev 55 0 R
+ /Parent 50 0 R
+ /First 57 0 R
+ /Last 61 0 R
+ /Prev 53 0 R
/Count -5
/A 19 0 R
>> endobj
-59 0 obj
+57 0 obj
<<
/Title
(\376\377\0\64\0\56\0\61\0\40\0\127\0\150\0\157\0\154\0\145\0\55\0\167\0\145\0\142\0\72\0\40\0\103\0\157\0\156\0\143\0\145\0\160\0\164\0\163)
- /Parent 58 0 R
- /Next 60 0 R
+ /Parent 56 0 R
+ /Next 58 0 R
/A 21 0 R
>> endobj
-60 0 obj
+58 0 obj
<<
/Title
(\376\377\0\64\0\56\0\62\0\40\0\127\0\150\0\157\0\154\0\145\0\55\0\167\0\145\0\142\0\72\0\40\0\102\0\157\0\157\0\163\0\164\0\162\0\141\0\160\0\160\0\151\0\156\0\147\0\40\0\164\0\150\0\145\0\40\0\127\0\145\0\142\0\40\0\104\0\141\0\164\0\141\0\142\0\141\0\163\0\145)
- /Parent 58 0 R
- /Prev 59 0 R
- /Next 61 0 R
+ /Parent 56 0 R
+ /Prev 57 0 R
+ /Next 59 0 R
/A 23 0 R
>> endobj
-61 0 obj
+59 0 obj
<<
/Title
(\376\377\0\64\0\56\0\63\0\40\0\127\0\150\0\157\0\154\0\145\0\55\0\167\0\145\0\142\0\72\0\40\0\106\0\145\0\164\0\143\0\150\0\151\0\156\0\147)
- /Parent 58 0 R
- /Prev 60 0 R
- /Next 62 0 R
+ /Parent 56 0 R
+ /Prev 58 0 R
+ /Next 60 0 R
/A 25 0 R
>> endobj
-62 0 obj
+60 0 obj
<<
/Title
(\376\377\0\64\0\56\0\64\0\40\0\127\0\150\0\157\0\154\0\145\0\55\0\167\0\145\0\142\0\72\0\40\0\111\0\156\0\144\0\145\0\170\0\151\0\156\0\147)
- /Parent 58 0 R
- /Prev 61 0 R
- /Next 63 0 R
+ /Parent 56 0 R
+ /Prev 59 0 R
+ /Next 61 0 R
/A 27 0 R
>> endobj
-63 0 obj
+61 0 obj
<<
/Title
(\376\377\0\64\0\56\0\65\0\40\0\123\0\145\0\141\0\162\0\143\0\150\0\151\0\156\0\147)
- /Parent 58 0 R
- /Prev 62 0 R
+ /Parent 56 0 R
+ /Prev 60 0 R
/A 29 0 R
>> endobj
-64 0 obj
+62 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F3
/BaseFont /Helvetica-Bold
/Encoding /WinAnsiEncoding >>
endobj
-65 0 obj
+63 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F5
/BaseFont /Times-Roman
/Encoding /WinAnsiEncoding >>
endobj
-66 0 obj
+64 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F6
/BaseFont /Times-Italic
/Encoding /WinAnsiEncoding >>
endobj
-67 0 obj
+65 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F1
/BaseFont /Helvetica
/Encoding /WinAnsiEncoding >>
endobj
-68 0 obj
+66 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F9
/BaseFont /Courier
/Encoding /WinAnsiEncoding >>
endobj
-69 0 obj
+67 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F2
/BaseFont /Helvetica-Oblique
/Encoding /WinAnsiEncoding >>
endobj
-70 0 obj
+68 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F7
@@ -482,19 +467,19 @@
endobj
1 0 obj
<< /Type /Pages
-/Count 6
-/Kids [6 0 R 31 0 R 41 0 R 43 0 R 47 0 R 49 0 R ] >>
+/Count 5
+/Kids [6 0 R 31 0 R 41 0 R 43 0 R 47 0 R ] >>
endobj
2 0 obj
<< /Type /Catalog
/Pages 1 0 R
- /Outlines 52 0 R
+ /Outlines 50 0 R
/PageMode /UseOutlines
>>
endobj
3 0 obj
<<
-/Font << /F3 64 0 R /F5 65 0 R /F1 67 0 R /F6 66 0 R /F9 68 0 R /F2 69 0 R /F7
70 0 R >>
+/Font << /F3 62 0 R /F5 63 0 R /F1 65 0 R /F6 64 0 R /F9 66 0 R /F2 67 0 R /F7
68 0 R >>
/ProcSet [ /PDF /ImageC /Text ] >>
endobj
9 0 obj
@@ -554,52 +539,52 @@
27 0 obj
<<
/S /GoTo
-/D [47 0 R /XYZ 85.0 468.7 null]
+/D [47 0 R /XYZ 85.0 527.86 null]
>>
endobj
29 0 obj
<<
/S /GoTo
-/D [47 0 R /XYZ 85.0 322.407 null]
+/D [47 0 R /XYZ 85.0 381.567 null]
>>
endobj
-52 0 obj
+50 0 obj
<<
- /First 53 0 R
- /Last 58 0 R
+ /First 51 0 R
+ /Last 56 0 R
>> endobj
xref
-0 71
+0 69
0000000000 65535 f
-0000017789 00000 n
-0000017882 00000 n
-0000017974 00000 n
+0000017160 00000 n
+0000017246 00000 n
+0000017338 00000 n
0000000015 00000 n
0000000071 00000 n
0000000922 00000 n
0000001042 00000 n
0000001137 00000 n
-0000018119 00000 n
+0000017483 00000 n
0000001271 00000 n
-0000018182 00000 n
+0000017546 00000 n
0000001408 00000 n
-0000018248 00000 n
+0000017612 00000 n
0000001545 00000 n
-0000018314 00000 n
+0000017678 00000 n
0000001682 00000 n
-0000018380 00000 n
+0000017744 00000 n
0000001819 00000 n
-0000018445 00000 n
+0000017809 00000 n
0000001956 00000 n
-0000018511 00000 n
+0000017875 00000 n
0000002092 00000 n
-0000018577 00000 n
+0000017941 00000 n
0000002229 00000 n
-0000018642 00000 n
+0000018006 00000 n
0000002366 00000 n
-0000018708 00000 n
+0000018072 00000 n
0000002503 00000 n
-0000018772 00000 n
+0000018137 00000 n
0000002640 00000 n
0000005299 00000 n
0000005422 00000 n
@@ -613,40 +598,38 @@
0000006777 00000 n
0000009086 00000 n
0000009194 00000 n
-0000011418 00000 n
-0000011541 00000 n
-0000011568 00000 n
-0000011738 00000 n
-0000013559 00000 n
-0000013667 00000 n
-0000014181 00000 n
-0000014304 00000 n
-0000014331 00000 n
-0000018838 00000 n
-0000014502 00000 n
-0000014665 00000 n
-0000014860 00000 n
-0000015107 00000 n
-0000015345 00000 n
-0000015605 00000 n
-0000015843 00000 n
-0000016056 00000 n
-0000016406 00000 n
-0000016633 00000 n
-0000016860 00000 n
-0000017016 00000 n
-0000017129 00000 n
-0000017239 00000 n
-0000017350 00000 n
-0000017458 00000 n
-0000017564 00000 n
-0000017680 00000 n
+0000011358 00000 n
+0000011481 00000 n
+0000011508 00000 n
+0000011678 00000 n
+0000013548 00000 n
+0000013671 00000 n
+0000013698 00000 n
+0000018203 00000 n
+0000013873 00000 n
+0000014036 00000 n
+0000014231 00000 n
+0000014478 00000 n
+0000014716 00000 n
+0000014976 00000 n
+0000015214 00000 n
+0000015427 00000 n
+0000015777 00000 n
+0000016004 00000 n
+0000016231 00000 n
+0000016387 00000 n
+0000016500 00000 n
+0000016610 00000 n
+0000016721 00000 n
+0000016829 00000 n
+0000016935 00000 n
+0000017051 00000 n
trailer
<<
-/Size 71
+/Size 69
/Root 2 0 R
/Info 4 0 R
>>
startxref
-18889
+18254
%%EOF
Modified:
incubator/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial.xml
URL:
http://svn.apache.org/viewcvs/incubator/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial.xml?view=diff&r1=161951&r2=161952
==============================================================================
--- incubator/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial.xml
(original)
+++ incubator/nutch/trunk/src/site/src/documentation/content/xdocs/tutorial.xml
Tue Apr 19 11:58:12 2005
@@ -203,10 +203,6 @@
<p>Now the database has entries for all of the pages referenced by the
initial set.</p>
-<p>Next we run five iterations of link analysis on the database in order
-to prioritize which pages to next fetch:</p>
-<source>bin/nutch analyze db 5
-</source>
<p>Now we fetch a new segment with the top-scoring 1000 pages:</p>
<source>bin/nutch generate db segments -topN 1000
s2=`ls -d segments/2* | tail -1`
@@ -214,7 +210,6 @@
bin/nutch fetch $s2
bin/nutch updatedb db $s2
-bin/nutch analyze db 2
</source>
<p>Let's fetch one more round:</p>
<source>
@@ -224,7 +219,7 @@
bin/nutch fetch $s3
bin/nutch updatedb db $s3
-bin/nutch analyze db 2</source>
+</source>
<p>By this point we've fetched a few thousand pages. Let's index
them!</p>
-------------------------------------------------------
This SF.Net email is sponsored by: New Crystal Reports XI.
Version 11 adds new functionality designed to reduce time involved in
creating, integrating, and deploying reporting solutions. Free runtime info,
new features, or free trial, at: http://www.businessobjects.com/devxi/728
_______________________________________________
Nutch-cvs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-cvs