Hi,

>> How can i merge all this files in order to have ALL informations about
one wikipedia-page in the same file ?


Are you using allmighty linux? Then you would have all the tools at hand, in order to do a hash based split based on the subjects of the triples.

with "bzcat *" you could merge ALL files (in a directory) into a single stream.
(If you just want everything in a single huge file, youre done now)

Then you pipe the stream into a little script something like:
#split each line into subject and remainder
|while   read  subject rest;  do

    h=compute hash based on subject

    # use the hash as a filename, and append our current line to that file
    echo "$subject $rest" >> $h
done
|
This would cause all triples with same subject to end up in the same file.

Afterwards you could use "sort -u filename" to sort the lines in a file, which would result in all triples with the same subject to be in consecutive rows.
(the -u would remove potential duplicate rows)

Kind regards,
Claus



On 01/13/2011 07:11 PM, Dimitris Kontokostas wrote:

    Yes, but there is one file for Ontology, one file for Titles, another
    for abstracts... etc...
    How can i merge all this files in order to have ALL informations about
    one wikipedia-page in the same file ?


you can't ;)
unless you set up your own server

but you can download all the json's you want and store them on you computer

cheers,
Jim


------------------------------------------------------------------------------
Protect Your Site and Customers from Malware Attacks
Learn about various malware tactics and how to avoid them. Understand
malware threats, the impact they can have on your business, and how you
can protect your company and customers by using code signing.
http://p.sf.net/sfu/oracle-sfdevnl


_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

------------------------------------------------------------------------------
Protect Your Site and Customers from Malware Attacks
Learn about various malware tactics and how to avoid them. Understand 
malware threats, the impact they can have on your business, and how you 
can protect your company and customers by using code signing.
http://p.sf.net/sfu/oracle-sfdevnl
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to