Thank's! I know! Very nice of you to help me!
------------------ ???????? ------------------ ??????: "Lewis John Mcgibbney"<[email protected]>; ????????: 2013??8??30??(??????) ????11:10 ??????: "[email protected]"<[email protected]>; ????: Re: How nutch2.2 to parse rss? yeah there is work to be done here for sure. there must be an issue open for this? On Thursday, August 29, 2013, Jonathan.Wei <[email protected]> wrote: > Thank's! > I try it! > But I have a felling that it will not build too! > Because some class file not find in nutch2.2! > example : > ParseData > ParseResult! > > > > > Thank you! > > > > > ------------------ ???????? ------------------ > ??????: "lewis john mcgibbney [via Lucene]"< [email protected]>; > ????????: 2013??8??30??(??????) ????9:34 > ??????: "????"<[email protected]>; > > ????: Re: How nutch2.2 to parse rss? > > > > Hi Jonathan, > This has been a long outstanding issue IIRC. > I have not used Nutch for feed crawling for a while if I am honest, and I > honestly can't recall when and if I have done it with 2.x. > You will see [0], that by default the plugin is not actually initialized. > So for starters you should uncomment the various targets within this file > [0] to get it working and to have it cleaned up etc. > You can then try building... but I have a feeling that it will not build. > Please check on our Jira for issues related to this... there may be patches > but I am not sure. > Kiran did some work a while back IIRC concerning getting following plugins > to compile and run > > <ant dir="feed" target="deploy"/> > <ant dir="parse-ext" target="deploy"/> > <ant dir="parse-swf" target="deploy"/> > <ant dir="parse-zip" target="deploy"/> > > But there is more work to be done. > Please keep us updated on this on. Sorry for late reply. > > [0] http://svn.apache.org/repos/asf/nutch/branches/2.x/src/plugin/build.xml > > > On Thu, Aug 29, 2013 at 1:29 AM, Jonathan.Wei <[hidden email]> wrote: > >> Hello!Every body! >> I want to use nutch2.2 to parse RSS ! >> But nutch2.x different with nutch1.x??So I down know how to parse >> rss!Can you help me? >> >> >> Use crawl command grab 24 URL, but the results suggest"Aborting with 10 >> hung >> threads." >> log content is : >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 466 0 kb/s, 13 >> URLs in 1 queues >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 461 0 kb/s, 13 >> URLs in 1 queues >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 455 0 kb/s, 13 >> URLs in 1 queues >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 450 0 kb/s, 13 >> URLs in 1 queues >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 444 0 kb/s, 13 >> URLs in 1 queues >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 439 0 kb/s, 13 >> URLs in 1 queues >> 0/10 spinwaiting/active, 11 pages, 0 errors, 0.0 0 pages/s, 434 0 kb/s, 13 >> URLs in 1 queues >> Aborting with 10 hung threads. >> >> What causes this? >> >> How I can fix it? >> >> Thank you! >> >> >> >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/How-nutch2-2-to-parse-rss-tp4087168.html >> Sent from the Nutch - User mailing list archive at Nabble.com. >> > > > > -- > *Lewis* > > > > If you reply to this email, your message will be added to the discussion below: > http://lucene.472066.n3.nabble.com/How-nutch2-2-to-parse-rss-and-Aborting-with-10-hung-threads-question-tp4087168p4087394.html > To unsubscribe from How nutch2.2 to parse rss? and "Aborting with 10 hung threads" question, click here. > NAML > > > > -- > View this message in context: http://lucene.472066.n3.nabble.com/How-nutch2-2-to-parse-rss-tp4087399.html > Sent from the Nutch - User mailing list archive at Nabble.com. -- *Lewis*

