Hello,
There is an issue with SimpleWikiParser in extraction framework
regarding template parsing. Strangely formatted templates like this one:
{{template | value |= }} are not parsed as templates nodes but text
nodes instead. Apart from preventing data extraction it results in
incorrect abstracts on Polish Dbpedia. For example on
http://pl.dbpedia.org/page/Agnieszka_Rylik the abstract contains infobox
parameter values.
BTW, I noticed a couple of issues I when trying to report this issue.
1) I couldn't submit a bug on SourceForge at
https://sourceforge.net/tracker/?group_id=190976&atid=935520. I got
permission denied error. Is there any reason to restrict bug reporting
to project members only?
2) I wanted to created a test case for it but I couldn't find any tests
for the parser part in the repository. Are there any?
Regards,
Piotr
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion