Hi Gaurav,

On 03/05/2013 09:51 AM, gaurav pant wrote:
> Hi All,
>
> I want to extract Abstract from page_article dump using 
> dbpedia_extracter . But some of the pages there are no proper 
> abstract. Like some says redirecting to some other page. or some have 
> some other non-required information.
>
> Is there any possibility to get cleaner abstract.
>
> After analyzing I come across below.
>
> if <text>...</text> tag has #REDIRECT|#redirect than those are 
> redirected pages.
>
> If anyone has some other ideas than please suggest me.

The abstract extractor requires special handling to work properly.
You can find details about how to get it working here [1].

>
> Thanks

[1] https://github.com/dbpedia/dbpedia/tree/master/abstractExtraction

-- 
Kind Regards
Mohamed Morsey
Department of Computer Science
University of Leipzig


------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_feb
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to