On Fri, Mar 19, 2010 at 9:53 AM, Shrinivasan T <[email protected]>wrote:

> Hi,
>
> >
> > Is there any webpage layout detection tool available in FOSS !!
>
> What do you mean by this?
>
>

A web page will be having left pane, right pane etc.. . As like a wiki pedia
article page. I have to extract the article only . Not the content from left
pane etc. I downloaded the Tamil wikipedia html dump and some blog pages. I
have to extract content from this .

-- 
**********************************
JAGANADH G
http://jaganadhg.freeflux.net/blog
_______________________________________________
ILUGC Mailing List:
http://www.ae.iitm.ac.in/mailman/listinfo/ilugc

Reply via email to