Hi Gaurav, On 03/05/2013 09:51 AM, gaurav pant wrote: > Hi All, > > I want to extract Abstract from page_article dump using > dbpedia_extracter . But some of the pages there are no proper > abstract. Like some says redirecting to some other page. or some have > some other non-required information. > > Is there any possibility to get cleaner abstract. > > After analyzing I come across below. > > if <text>...</text> tag has #REDIRECT|#redirect than those are > redirected pages. > > If anyone has some other ideas than please suggest me.
The abstract extractor requires special handling to work properly. You can find details about how to get it working here [1]. > > Thanks [1] https://github.com/dbpedia/dbpedia/tree/master/abstractExtraction -- Kind Regards Mohamed Morsey Department of Computer Science University of Leipzig ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_feb _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
