You can use the nutch or opensearch api
this tutorial should give you some ideas to dig into...
http://today.java.net/pub/a/today/2006/02/16/introduction-to-nutch-2.html
Saurabh Suman wrote:
Hi
I am new to nutch. My search nutch is working fine.I can see the result
through tomcat.Http://loca
Can anyone point me at some Nutch Plugin documentation which goes into
more detail than
http://wiki.apache.org/nutch/WritingPluginExample-0.9
I want to understand all the different places where you might put a
plugin and why/how you might do so.
Basically I am trying to extra some information ou
Example1:
Error parsing:
http://localhost/mydocs/Programacion/Web/Ajax/Ajax.Hacks.Tips.and.Tools.for.Creating.Responsive.Web.Sites.Mar.2006.chm:
org.apache.nutch.parse.ParseException: parser not found for
contentType=chemical/x-chemdraw
url=http://localhost/mydocs/Programacion/Web/Ajax/Ajax.H
Example1:
Error parsing:
http://localhost/mydocs/Programacion/Web/Ajax/Ajax.Hacks.Tips.and.Tools.for.Creating.Responsive.Web.Sites.Mar.2006.chm:
org.apache.nutch.parse.ParseException: parser not found for
contentType=chemical/x-chemdraw
url=http://localhost/mydocs/Programacion/Web/Ajax/Ajax.Ha
Hi all, how i can index .chm documents ?
On Mon, Jul 6, 2009 at 3:10 PM, youyou wu wrote:
> hi Susam,
>
> I have read your work about HttpAuthenticationSchemes,and configed all the
> file according to what you said , but no authentication occured .
> I can find the word "Credentials" but no "auth.AuthChallengeProcessor " in
> my logs. I
hi Susam,
I have read your work about HttpAuthenticationSchemes,and configed all the file
according to what you said , but no authentication occured .
I can find the word "Credentials" but no "auth.AuthChallengeProcessor " in my
logs. I have also tried it in different websites requiring Auth
Hi,
I am using Nutch 1.0 with 10 node cluster.
I have crawled 1000 sites with 10 depth.
I got following cluster summary
5538 files and directories, 4556 blocks = 10094 total. Heap Size is 50 MB /
888.94 MB (5%)
Configured Capacity : 140 TB
DFS Used :
Hi,
I am using Nutch 1.0 with 10 node cluster.
I have crawled 1000 sites with 10 depth.
I got following cluster summary
5538 files and directories, 4556 blocks = 10094 total. Heap Size is 50 MB /
888.94 MB (5%)
Configured Capacity : 140 TB
DFS Used :
Hi
I am new to nutch. My search nutch is working fine.I can see the result
through tomcat.Http://localhost:8080.
I want to search the DB through Java Code. Like suppose i pass a query
apache. I want content,url,metadata . How will i get this?
--
View this message in context:
http://www.nabble.c
10 matches
Mail list logo