Re: Admin Gui beta test (was Re: ATB: Heritrix)
Hi Stefan, did you find a solution? I'd really like to give the admin gui a try. Cheers Karsten PS: My offer to host that file is still open :-) Stefan Groschupf schrieb: I think it should be possible to put your binary at the Apache site, probably Doug will be the right person to talk to ... Have you tried attaching it to a Jira issue? The nutch -xxxtar.gz is 67MB. The maximum file upload size is 10.00 Mb . If that fails, you could attach it to a page on the Wiki, no? Is that a good idea? The file is that big and we already got the request to use the apache mirror servers. Anyway I already got some offline offers from people, just was thinking it is a good idea to leave that running under the nutch project flag.
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Sorry, I am on holiday until the 8th of May. Please contact the [EMAIL PROTECTED] for urgent matters. Kind regards, Herman.
RE: Admin Gui beta test (was Re: ATB: Heritrix)
Stefan - I can host the file at http://www.oaktreesecurity.com if you would like. I have about 2 gigs of bandwidth a month, and I use maybe 10 megs, I think I can accommodate. I am more than happy to host a free standing binary. Do you have a windows compatible version (or will it run in cygwin), or is it Linux only? r/d -Original Message- From: Stefan Groschupf [mailto:[EMAIL PROTECTED] Sent: Friday, April 28, 2006 6:24 AM To: nutch-user@lucene.apache.org Subject: Admin Gui beta test (was Re: ATB: Heritrix) Hi there, since building the gui is some how complicated I was thinking about providing a ready to use binary. This may be would help to get some more beta testers we currently looking for. Any thoughts? However I afraid that this would hit my server to hard and I have to pay for traffic. :-/ Does any one has an idea where we can mirror this file for free? Any volunteer is very welcome. Thanks. Stefan Am 28.04.2006 um 15:14 schrieb Aled Jones: Thanks for your replies guys. I hadn't realised that the admin gui was already in development. We should be able to cope till it gets released ;-) Thanks again Aled -Neges Wreiddiol-/-Original Message- Oddi wrth/From: Dan Morrill [mailto:[EMAIL PROTECTED] Anfonwyd/Sent: 28 April 2006 14:07 At/To: nutch-user@lucene.apache.org Pwnc/Subject: RE: Heritrix Aled, I used heritrix before going over to nutch, while it is an excellent program, with lots of good things to offer, it didn't quite meet my need, and when designing the architecture had too many dependencies for me to be comfortable with. If you want to run an internet archive though, heritrix can not be beat, if you want to run a search engine, nutch is a good choice. My personal opinion. r/d -Original Message- From: Aled Jones [mailto:[EMAIL PROTECTED] Sent: Friday, April 28, 2006 1:59 AM To: nutch-user@lucene.apache.org Subject: Heritrix Hi Anyone used Heritrix (http://crawler.archive.org/) as a crawler? How does it compare with the Nutch crawler? Can Nutch serve its crawled results? Main reason I'm interested is that it has a WUI interface that might make maintenance for the IT guys easier, although I know that some of you guys are working on an interface. Cheers Aled ### This message has been scanned by F-Secure Anti-Virus for Microsoft Exchange. For more information, connect to http://www.f-secure.com/ ** ** This e-mail and any attachments are strictly confidential and intended solely for the addressee. They may contain information which is covered by legal, professional or other privilege. If you are not the intended addressee, you must not copy the e-mail or the attachments, or use them for any purpose or disclose their contents to any other person. To do so may be unlawful. If you have received this transmission in error, please notify us as soon as possible and delete the message and attachments from all places in your computer where they are stored. Although we have scanned this e-mail and any attachments for viruses, it is your responsibility to ensure that they are actually virus free. ### This message has been scanned by F-Secure Anti-Virus for Microsoft Exchange. For more information, connect to http://www.f-secure.com/ ** ** This e-mail and any attachments are strictly confidential and intended solely for the addressee. They may contain information which is covered by legal, professional or other privilege. If you are not the intended addressee, you must not copy the e-mail or the attachments, or use them for any purpose or disclose their contents to any other person. To do so may be unlawful. If you have received this transmission in error, please notify us as soon as possible and delete the message and attachments from all places in your computer where they are stored. Although we have scanned this e-mail and any attachments for viruses, it is your responsibility to ensure that they are actually virus free. - blog: http://www.find23.org company: http://www.media-style.com
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Hi Stefan I would be willing to host the app. I have virutal dedicated server from Godaddy with Fedora core2 and apache webserver and tomcat running. The IP address is http://68.178.249.66 Right now, on webserver side, I have a default page (hosted by godaddy running) But can make sure the Admin GUI is running.. I might need some help, but should not be a problem at all. Thanks Sudhi Stefan Groschupf [EMAIL PROTECTED] wrote: Hi there, since building the gui is some how complicated I was thinking about providing a ready to use binary. This may be would help to get some more beta testers we currently looking for. Any thoughts? However I afraid that this would hit my server to hard and I have to pay for traffic. :-/ Does any one has an idea where we can mirror this file for free? Any volunteer is very welcome. Thanks. Stefan Am 28.04.2006 um 15:14 schrieb Aled Jones: Thanks for your replies guys. I hadn't realised that the admin gui was already in development. We should be able to cope till it gets released ;-) Thanks again Aled -Neges Wreiddiol-/-Original Message- Oddi wrth/From: Dan Morrill [mailto:[EMAIL PROTECTED] Anfonwyd/Sent: 28 April 2006 14:07 At/To: nutch-user@lucene.apache.org Pwnc/Subject: RE: Heritrix Aled, I used heritrix before going over to nutch, while it is an excellent program, with lots of good things to offer, it didn't quite meet my need, and when designing the architecture had too many dependencies for me to be comfortable with. If you want to run an internet archive though, heritrix can not be beat, if you want to run a search engine, nutch is a good choice. My personal opinion. r/d -Original Message- From: Aled Jones [mailto:[EMAIL PROTECTED] Sent: Friday, April 28, 2006 1:59 AM To: nutch-user@lucene.apache.org Subject: Heritrix Hi Anyone used Heritrix (http://crawler.archive.org/) as a crawler? How does it compare with the Nutch crawler? Can Nutch serve its crawled results? Main reason I'm interested is that it has a WUI interface that might make maintenance for the IT guys easier, although I know that some of you guys are working on an interface. Cheers Aled ### This message has been scanned by F-Secure Anti-Virus for Microsoft Exchange. For more information, connect to http://www.f-secure.com/ ** ** This e-mail and any attachments are strictly confidential and intended solely for the addressee. They may contain information which is covered by legal, professional or other privilege. If you are not the intended addressee, you must not copy the e-mail or the attachments, or use them for any purpose or disclose their contents to any other person. To do so may be unlawful. If you have received this transmission in error, please notify us as soon as possible and delete the message and attachments from all places in your computer where they are stored. Although we have scanned this e-mail and any attachments for viruses, it is your responsibility to ensure that they are actually virus free. ### This message has been scanned by F-Secure Anti-Virus for Microsoft Exchange. For more information, connect to http://www.f-secure.com/ ** ** This e-mail and any attachments are strictly confidential and intended solely for the addressee. They may contain information which is covered by legal, professional or other privilege. If you are not the intended addressee, you must not copy the e-mail or the attachments, or use them for any purpose or disclose their contents to any other person. To do so may be unlawful. If you have received this transmission in error, please notify us as soon as possible and delete the message and attachments from all places in your computer where they are stored. Although we have scanned this e-mail and any attachments for viruses, it is your responsibility to ensure that they are actually virus free. - blog: http://www.find23.org company: http://www.media-style.com Sudhi Seshachala http://sudhilogs.blogspot.com/ - Yahoo! Mail goes everywhere you do. Get it on your phone.
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Stefan Groschupf wrote: Hi there, since building the gui is some how complicated I was thinking about providing a ready to use binary. This may be would help to get some more beta testers we currently looking for. Any thoughts? However I afraid that this would hit my server to hard and I have to pay for traffic. :-/ Does any one has an idea where we can mirror this file for free? Any volunteer is very welcome. I think it should be possible to put your binary at the Apache site, probably Doug will be the right person to talk to ... -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Sorry, I am on holiday until the 8th of May. Please contact the [EMAIL PROTECTED] for urgent matters. Kind regards, Herman.
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Andrzej Bialecki wrote: I think it should be possible to put your binary at the Apache site, probably Doug will be the right person to talk to ... Have you tried attaching it to a Jira issue? If that fails, you could attach it to a page on the Wiki, no? Doug
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Sorry, I am on holiday until the 8th of May. Please contact the [EMAIL PROTECTED] for urgent matters. Kind regards, Herman.
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Sorry, I am on holiday until the 8th of May. Please contact the [EMAIL PROTECTED] for urgent matters. Kind regards, Herman.
Re: Admin Gui beta test (was Re: ATB: Heritrix)
what about putting it on sourceforge? http://sourceforge.net/projects/nutch - Original Message - From: Doug Cutting [EMAIL PROTECTED] To: nutch-user@lucene.apache.org Sent: Saturday, April 29, 2006 12:18 AM Subject: Re: Admin Gui beta test (was Re: ATB: Heritrix) Stefan Groschupf wrote: If that fails, you could attach it to a page on the Wiki, no? Is that a good idea? The file is that big and we already got the request to use the apache mirror servers. The Apache mirrors are really for signed, Apache releases, which this is not. Is it too big for the wiki? Doug
Re: Admin Gui beta test (was Re: ATB: Heritrix)
Sorry, I am on holiday until the 8th of May. Please contact the [EMAIL PROTECTED] for urgent matters. Kind regards, Herman.
Re: Admin GUI
Hi Daniel, thanks we still working on it. Actually we have to finish something behind the sense and than we will publish a kind of plugin extension point that will allow other people to contribute. Thanks for the offer, may be the only thing you can do is to vote for this issue since this is somehow related to meta data, nutch admin gui and nutch 'instances'. :-D http://issues.apache.org/jira/browse/NUTCH-204 Cheers, Stefan Am 23.02.2006 um 12:56 schrieb Daniel Färnbo: Hi Stefan! How is the progress, any updates? Need any help? http://wiki.apache.org/nutch/NutchAdministrationUserInterface /Daniel --- company:http://www.media-style.com forum:http://www.text-mining.org blog:http://www.find23.net
Re: Admin GUI
Hi Stefan The GUI looks great! My idea is to add ajax tech. to reduce the page reload and show the job progress in realtime. If contribution is welcome and no one is working on this, I'd like to take this. Regards /Jack On 2/23/06, Stefan Groschupf [EMAIL PROTECTED] wrote: Hi Daniel, thanks we still working on it. Actually we have to finish something behind the sense and than we will publish a kind of plugin extension point that will allow other people to contribute. Thanks for the offer, may be the only thing you can do is to vote for this issue since this is somehow related to meta data, nutch admin gui and nutch 'instances'. :-D http://issues.apache.org/jira/browse/NUTCH-204 Cheers, Stefan Am 23.02.2006 um 12:56 schrieb Daniel Färnbo: Hi Stefan! How is the progress, any updates? Need any help? http://wiki.apache.org/nutch/NutchAdministrationUserInterface /Daniel --- company:http://www.media-style.com forum:http://www.text-mining.org blog:http://www.find23.net -- Keep Discovering ... ... http://www.jroller.com/page/jmars
Re: Admin GUI
On 2/24/06, Stefan Groschupf [EMAIL PROTECTED] wrote: Hi Jack, The GUI looks great! I will forward this to Frank Henze he had done the design and sample. :) Thanks. I'll prepare some utility and debug javascript classes from now on:) My idea is to add ajax tech. to reduce the page reload and show the job progress in realtime. If contribution is welcome and no one is working on this, I'd like to take this. That is great idea, I like ajax very much. However actually our plan looks like: 1.) getting nutch to a status that a gui is possible 2.) writing a underlaying extension point api 3.) writing some basic plugins using this extension point 4.) getting things stabile 5.) getting things into the nutch sources - that could take some time 6.) improving ui with better graphics, ajax etc. So please stay patient, we may need some more time. Since I also get some off-list mails a short status update. The first point is pretty much done Marko will contribute a update until next days. I'm having a kind of alpha for for 2. and some very first steps for 3. are also already done. Anyway we need some more time, please stay patient. Stefan Regards /Jack On 2/23/06, Stefan Groschupf [EMAIL PROTECTED] wrote: Hi Daniel, thanks we still working on it. Actually we have to finish something behind the sense and than we will publish a kind of plugin extension point that will allow other people to contribute. Thanks for the offer, may be the only thing you can do is to vote for this issue since this is somehow related to meta data, nutch admin gui and nutch 'instances'. :-D http://issues.apache.org/jira/browse/NUTCH-204 Cheers, Stefan Am 23.02.2006 um 12:56 schrieb Daniel Färnbo: Hi Stefan! How is the progress, any updates? Need any help? http://wiki.apache.org/nutch/NutchAdministrationUserInterface /Daniel --- company:http://www.media-style.com forum:http://www.text-mining.org blog:http://www.find23.net -- Keep Discovering ... ... http://www.jroller.com/page/jmars - blog: http://www.find23.org company: http://www.media-style.com -- Keep Discovering ... ... http://www.jroller.com/page/jmars