Hello  Gafur,

Am 04.12.2012 05:35, schrieb [email protected]:
> Hi,
>
> is it possible (in a appropriate way, i'm not familiar with wikipedia 
> extraction (framework)) or what is the easiest way for getting the amount of 
> each link which is linking from one to another DBPedia instance. In the 
> Wikipedia Pagelinks dump you have only the information, that one DBPedia 
> instance is linking to another DBPedia instance, but the cardinality is 
> missing for me, i like to know how many times one DBPedia instance is linking 
> to another DBPedia instance.
>
> Do you know a easy way for this?

There are several metrics that you could want:

1. Each DBpedia instance is only linked maximally once to each other 
instance via pagelinks as there can not be duplicate triples.
2. You can count in and out degree, i.e. number of pagelinks per 
instance as subject or object respectively.
Unix is you friend here ( 
http://code.google.com/p/aksw-commons/wiki/RDFStatistics)
wget http://downloads.dbpedia.org/3.8/en/page_links_en.nt.bz2
bzcat page_links_en.nt.bz2 | cut -f1 -d '>' | sed 's/<//;s/>//' | awk 
'{count[$1]++}END{for(j in count) print "<" j ">" "\t"count [j]}' > 
outdegree_subjects.tsv
-- or --
bzcat page_links_en.nt.bz2 | grep -v '"' | cut -f3 -d '>' | sed 
's/<//;s/>//' | awk '{count[$1]++}END{for(j in count) print "<" j ">" 
"\t"count[j]}' > indegree_objects.tsv

3. For individual counts see Mohameds email.

All the best,
Sebastian




>
> best regards and thank you for replies!
> Gafur
>
>
> ------------------------------------------------------------------------------
> Keep yourself connected to Go Parallel:
> BUILD Helping you discover the best ways to construct your parallel projects.
> http://goparallel.sourceforge.net
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>


-- 
Dipl. Inf. Sebastian Hellmann
Department of Computer Science, University of Leipzig
Events:
* SWJ Special Issue for Multilingual LOD (*Deadline: Nov 23rd 2012*) - 
http://goo.gl/Bkwts
Projects: http://nlp2rdf.org , http://dbpedia.org
Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
Research Group: http://aksw.org


------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to