Hi,
>> How can i merge all this files in order to have ALL informations about
one wikipedia-page in the same file ?
Are you using allmighty linux? Then you would have all the tools at
hand, in order to do a hash based split based on the subjects of the
triples.
with "bzcat *" you could merge ALL files (in a directory) into a single
stream.
(If you just want everything in a single huge file, youre done now)
Then you pipe the stream into a little script something like:
#split each line into subject and remainder
|while read subject rest; do
h=compute hash based on subject
# use the hash as a filename, and append our current line to that file
echo "$subject $rest" >> $h
done
|
This would cause all triples with same subject to end up in the same file.
Afterwards you could use "sort -u filename" to sort the lines in a file,
which would result in all triples with the same subject to be in
consecutive rows.
(the -u would remove potential duplicate rows)
Kind regards,
Claus
On 01/13/2011 07:11 PM, Dimitris Kontokostas wrote:
Yes, but there is one file for Ontology, one file for Titles, another
for abstracts... etc...
How can i merge all this files in order to have ALL informations about
one wikipedia-page in the same file ?
you can't ;)
unless you set up your own server
but you can download all the json's you want and store them on you
computer
cheers,
Jim
------------------------------------------------------------------------------
Protect Your Site and Customers from Malware Attacks
Learn about various malware tactics and how to avoid them. Understand
malware threats, the impact they can have on your business, and how you
can protect your company and customers by using code signing.
http://p.sf.net/sfu/oracle-sfdevnl
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
------------------------------------------------------------------------------
Protect Your Site and Customers from Malware Attacks
Learn about various malware tactics and how to avoid them. Understand
malware threats, the impact they can have on your business, and how you
can protect your company and customers by using code signing.
http://p.sf.net/sfu/oracle-sfdevnl
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion