Author: ogrisel
Date: Fri May 20 09:52:50 2011
New Revision: 1125301
URL: http://svn.apache.org/viewvc?rev=1125301&view=rev
Log:
make it possible to just copy and past this command for convenience
Modified:
incubator/stanbol/trunk/entityhub/indexing/dbpedia/README.md
Modified: incubator/stanbol/trunk/entityhub/indexing/dbpedia/README.md
URL:
http://svn.apache.org/viewvc/incubator/stanbol/trunk/entityhub/indexing/dbpedia/README.md?rev=1125301&r1=1125300&r2=1125301&view=diff
==============================================================================
--- incubator/stanbol/trunk/entityhub/indexing/dbpedia/README.md (original)
+++ incubator/stanbol/trunk/entityhub/indexing/dbpedia/README.md Fri May 20
09:52:50 2011
@@ -82,13 +82,16 @@ on DBpedia (page_links_en.nt.bz2). This
following command to get an file containing an ordered list of incomming
count and the local name of the entity.
- curl http://downloads.dbpedia.org/{version}/en/page_links_en.nt.bz2 \
+ curl http://downloads.dbpedia.org/3.6/en/page_links_en.nt.bz2 \
| bzcat \
| sed -e 's/.*<http\:\/\/dbpedia\.org\/resource\/\([^>]*\)> ./\1/' \
| sort \
| uniq -c \
| sort -nr > incoming_links.txt
+Note: replace "3.6" by the latest release version of DBpedia in the above
+command line to get up to date data.
+
Depending on the machine and the download speed for the source file the
execution of this command will take several hours.