[ https://issues.apache.org/jira/browse/RYA-500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16533683#comment-16533683 ]
ASF GitHub Bot commented on RYA-500: ------------------------------------ Github user ejwhite922 commented on a diff in the pull request: https://github.com/apache/incubator-rya/pull/299#discussion_r200360089 --- Diff: extras/rya.manual/src/site/markdown/loaddata.md --- @@ -92,29 +92,56 @@ The default "format" is RDF/XML, but these formats are supported : RDFXML, NTRIP ## Bulk Loading data -Bulk loading data is done through Map Reduce jobs +Bulk loading data is done through Map Reduce jobs. ### Bulk Load RDF data -This Map Reduce job will read files into memory and parse them into statements. The statements are saved into the store. Here is an example for storing in Accumulo: +This Map Reduce job will read files into memory and parse them into statements. The statements are saved into the triplestore. +Here are the steps to prepare and run the job: + + * Load the RDF data to HDFS. It can be single of multiple volumes and directories in them. --- End diff -- The last sentence should be reworded. "It can be single or multiple volumes and can have directories in them." > Make RdfFileInputTool to accept multiple input paths > ---------------------------------------------------- > > Key: RYA-500 > URL: https://issues.apache.org/jira/browse/RYA-500 > Project: Rya > Issue Type: Improvement > Affects Versions: 3.2.12 > Reporter: Maxim Kolchin > Priority: Trivial > Labels: mapreduce > > We store RDF files in multiple folders where each folder contains data about > a specific type of entity (e.g. person, company, etc.). So it's not > convenient that the RdfFileInputTool allows only a single input path. -- This message was sent by Atlassian JIRA (v7.6.3#76005)