Hi all,
we are students of the Hasso Plattner Institute and take part in a seminar of
the Semantic Web chair. Our task is to extract information from Wikipedia „List
of“ pages to DBpedia. Therefore we thought about using the
DBpediaExtractionFramework.
We have some questions regarding the framework:
We use IntelliJ for development. But if we place a breakpoint in the extraction
method of our extractor and debug our maven goal the debugger doesn’t stop. We
already tried out to use mvnDebug and attach to it using the RemoteDebugger
from IntelliJ. Is there anything we need to do, to debug the framework?
There is a dataset needed to be set for each Extractor, what is purpose of it?
Is there a way to add state to the extraction process or some static
information? It seems for us like the context object does something like that,
but we don’t really understand where the content comes from and how to add new
objects to it.
We also want to extract List_of pages which are in a table format. We found the
classes `Table Node, TableRowNode, TableCellNode` which we would like to use.
But if we extend `PageNodeExtractor` the tables don’t get wrapped in these
classes, but are just TextNodes and InternalLinkNodes. There is a class called
TableMapping, which looks handy, but we don’t know if and how we could use it.
Is there a way to do after processing of the results?
Thanks in advance for answering all the questions.
Kind Regards,
Patrick Kuhn, Fabian Windheuser, Sven Mischkewitz and Nico Ring
------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________
Dbpedia-developers mailing list
Dbpedia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers