On Apr 16, 2013 2:41 PM, "Shivani Poddar" <[email protected]>
wrote:
>
> JC,
>
> Thanks a lot for your prompt reply.
> Will fix the depreciated documentations in the updated wiki :)

Thank you!

>
> Thanks a lot,
> Shivani
>
>
>
> On Tue, Apr 16, 2013 at 6:08 PM, Jona Christopher Sahnwaldt <
[email protected]> wrote:
>>
>> That shell script is deprecated as well. Please use
https://github.com/dbpedia/extraction-framework/blob/master/scripts/src/main/scala/org/dbpedia/extraction/scripts/ProcessInterLanguageLinks.scalainstead.
Sorry for the inconvenience and thanks for your work and patience!
>>
>> On Apr 16, 2013 2:30 PM, "Shivani Poddar" <[email protected]>
wrote:
>>>
>>>
>>>
>>> Hi Dimitris,
>>>
>>> It is very encouraging to see such a responsive community :). I am very
excited to get through with as many tasks as I can (with each one I am more
confident about the project.
>>> I have been working on what you suggested and encountered the following
problems.
>>> Once I am through with the configuration etc I will be moving the
complete page http://dbpedia.org/Internationalization/Guide#h152-7 to the
new wiki along with the updated parts visa vi the deprecated ones.
>>>
>>>
>>>> You are right, this part is now deprecated. The configuration changed
last year and we didn't fix that part in the docs.
>>>> There is also a full page in github explaining the format syntax [1].
>>>>
>>>> As a warm up task you can setup the configurations for you language
and create a sample dump out of it.
>>>> Then you can update the documentation according to your experience.
I'd also suggest that you move the whole page to github and link to it from
the main (gihub) wiki page.
>>>>
>>>
>>>
>>> Trying to follow the directions given in
http://dbpedia.org/Internationalization/Guide#h152-7 by changing the
appropriate directories etc.
>>> I get the following error when I try to accomplish "4. Interlinking" :
>>>
>>> $ shivani
~/extraction-framework/scripts/shell-scripts/interwiki_links-->$ sh
interwiki_links.sh 'en' 'hi'
>>> sed: can't read
/home/shivani/extraction-framework/scripts/shell-scripts/interwiki_links/../../../dump/config.properties:
No such file or directory
>>> /en/interlanguage_links_en.nt not found! exiting...
>>> /hi/interlanguage_links_hi.nt not found! exiting...
>>>
-------------------------------------------------------------------------------
>>> Generating interlanguage links from en to hi
>>>
-------------------------------------------------------------------------------
>>> interwiki_links.sh: line 59: /hi/interlanguage_links_hi.nt.reversed.en:
No such file or directory
>>> grep: /hi/interlanguage_links_hi.nt: No such file or directory
>>> interwiki_links.sh: line 61: /en/interlanguage_links_en.nt.sorted.hi:
No such file or directory
>>> grep: /en/interlanguage_links_en.nt: No such file or directory
>>> interwiki_links.sh: line 64: /en/sameas-hi-en.nt: No such file or
directory
>>> wc: /en/sameas-hi-en.nt: No such file or directory
>>>
>>>
>>>  I speculate that this should be because of faulty configuration of As
indicated in the old wiki (/dump/extract.properties) ->As should be in the
new wiki (/dump/extraction.iri.same.as.uri.properties)
>>> in the later section of "2. Encoding / resource namespace / titles"
>>>
>>> It would be great if you could cite the respective fixes for this.
>>>
>>>
>>> Thanks a lot !
>>> Shivani
>>>
>>>
>>>>
>>>> Cheers,
>>>> Dimitris
>>>>
>>>> [1]
https://github.com/dbpedia/extraction-framework/wiki/Input-File-Format-In-DBpedia-Extraction-Framework
>>>>
>>>>
>>>> On Tue, Apr 16, 2013 at 4:50 AM, Shivani Poddar <
[email protected]> wrote:
>>>>>
>>>>> Hi,
>>>>> The following page might have a couple of errors which I encountered
while setting up the codebase to begin contributing for the "Design a
better / interactive display page." project :
>>>>>
>>>>> http://dbpedia.org/Internationalization/Guide#h152-7
>>>>>
>>>>>  The second heading "2. Encoding / resource namespace / titles"
directs the user at changing the following :
>>>>>
>>>>> [extraction_framework/core/src/main/scala]
>>>>> org.dbpedia.extraction.util.Language.scala
>>>>>
>>>>>      // default: no language use generic domain
>>>>>      val generic = Set[String]()
>>>>>
>>>>>      // change to this if language xx should be extracted using the
generic domain
>>>>>      val generic = Set("xx")
>>>>>
>>>>> Here the file name is not org.dbpedia.extraction.util.Language.scala,
but the file path is
"extraction-framework/core/src/main/scala/org/dbpedia/extraction/util/Language.scala"
>>>>>
>>>>> secondly the refereed variables cannot be located in the file.
>>>>> Are they supposed to be created ??
>>>>>
>>>>>
>>>>> Same for the dump/extraction.default.properties file.
>>>>> It is suggested that the value of the format variable be adjusted ,
while the file already has settings like
>>>>>
>>>>> 105 # NT is unreadable anyway - might as well use URIs for en
>>>>> 106 format.nt.gz=n-triples;uri-policy.uri
>>>>> 107 format.nq.gz=n-quads;uri-policy.uri
>>>>> 108
>>>>> 109 # Turtle is much more readable - use nice IRIs for all languages
>>>>> 110 format.ttl.gz=turtle-triples;uri-policy.iri
>>>>> 111 format.tql.gz=turtle-quads;uri-policy.iri
>>>>>
>>>>> It would be helpful if the documentation is more specific. I could
tweak the documentation with the respective feedback here.
>>>>>
>>>>> Thank You,
>>>>> Shivani
>>>>>
>>>>>
>>>>>
------------------------------------------------------------------------------
>>>>> Precog is a next-generation analytics platform capable of advanced
>>>>> analytics on semi-structured data. The platform includes APIs for
building
>>>>> apps and a phenomenal toolset for data science. Developers can use
>>>>> our toolset for easy data analysis & visualization. Get a free
account!
>>>>> http://www2.precog.com/precogplatform/slashdotnewsletter
>>>>> _______________________________________________
>>>>> Dbpedia-gsoc mailing list
>>>>> [email protected]
>>>>> https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Dimitris Kontokostas
>>>> Department of Computer Science, University of Leipzig
>>>> Research Group: http://aksw.org
>>>> Homepage:http://aksw.org/DimitrisKontokostas
>>>
>>>
>>>
>>>
>>> --
>>> Shivani Poddar,
>>> Bachelors in Computer Sciences and MS in Exact Humanities, Sophomore
>>> International Institute of Information Technology, Hyderabad
>>>
>>>
------------------------------------------------------------------------------
>>> Precog is a next-generation analytics platform capable of advanced
>>> analytics on semi-structured data. The platform includes APIs for
building
>>> apps and a phenomenal toolset for data science. Developers can use
>>> our toolset for easy data analysis & visualization. Get a free account!
>>> http://www2.precog.com/precogplatform/slashdotnewsletter
>>> _______________________________________________
>>> Dbpedia-gsoc mailing list
>>> [email protected]
>>> https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
>>>
>
>
>
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc

Reply via email to