Ok, thanks. I misunderstood the use of the term "load."
On Fri, Aug 9, 2013 at 12:43 PM, Andy Seaborne <[email protected]> wrote: > On 09/08/13 17:41, Brad Moran wrote: > >> I am trying to construct a lucene text index for the jena TDB that I >> already have built. From the documentation: >> http://jena.apache.org/**documentation/query/text-** >> query.html#building-a-text-**index<http://jena.apache.org/documentation/query/text-query.html#building-a-text-index> >> it seems I have to load the TDB using tdbloader(my TDB is quite large so I >> decided to use tdbloader2) the I have to run textindexer using an >> assembler >> file. Right now I am having trouble loading the TDB that needs to be >> indexed. >> > > You gave already loaded it. You don't need to load it again. > > Move on to the next step. > > (Rob's clarified the instructions: > > http://jena.staging.apache.**org/documentation/query/text-** > query.html#building-a-text-**index<http://jena.staging.apache.org/documentation/query/text-query.html#building-a-text-index> > > ) > > Andy > > >> >> On Fri, Aug 9, 2013 at 12:30 PM, Rob Vesse <[email protected]> wrote: >> >> Brad >>> >>> I still don't understand what you are trying to do here? >>> >>> If you already have a TDB dataset then you do not need to run tdbloader2 >>> >>> If you need to pass a TDB dataset to another Jena utility such as Fuseki >>> or the text indexer that expects a TDB location then you should simply >>> pass in the directory path to the directory where the TDB dataset is >>> located. >>> >>> Rob >>> >>> >>> >>> On 8/9/13 9:21 AM, "Brad Moran" <[email protected]> wrote: >>> >>> I tried doing this with all files in my TDB: >>>> >>>> jena-2.10.2/apache-jena-2.10.**2-SNAPSHOT/bin/tdbloader2 --loc >>>> NetBeansProjects/mdr-older/**trunk/tdb >>>> NetBeansProjects/mdr-older/**trunk/tdb/GOSP.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/GOSP.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/GPOS.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/GPOS.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/GSPO.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/GSPO.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/journal.jrnl >>>> NetBeansProjects/mdr-older/**trunk/tdb/node2id.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/node2id.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/nodes.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/nodes.dat-jrnl >>>> NetBeansProjects/mdr-older/**trunk/tdb/OSP.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/OSP.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/OSPG.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/OSPG.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/POS.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/POS.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/POSG.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/POSG.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/prefix2id.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/prefix2id.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/prefixes.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/prefixes.dat-jrnl >>>> NetBeansProjects/mdr-older/**trunk/tdb/prefixIdx.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/prefixIdx.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/SPO.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/SPO.idn >>>> NetBeansProjects/mdr-older/**trunk/tdb/SPOG.dat >>>> NetBeansProjects/mdr-older/**trunk/tdb/SPOG.idn >>>> >>>> Then get: >>>> >>>> 11:58:38 -- TDB Bulk Loader Start >>>> 11:58:38 Data phase >>>> INFO Load: NetBeansProjects/mdr-older/**trunk/tdb/GOSP.dat -- >>>> 2013/08/09 >>>> 11:58:40 EDT >>>> Exception in thread "main" org.apache.jena.atlas.**AtlasException: >>>> java.nio.charset.**MalformedInputException: Input length = 1 >>>> at org.apache.jena.atlas.io.IO.**exception(IO.java:206) >>>> at >>>> org.apache.jena.atlas.io.**CharStreamBuffered$** >>>> SourceReader.fill(CharStreamBu >>>> ffered.java:79) >>>> at >>>> org.apache.jena.atlas.io.**CharStreamBuffered.fillArray(** >>>> CharStreamBuffered.j >>>> ava:156) >>>> at >>>> org.apache.jena.atlas.io.**CharStreamBuffered.advance(** >>>> CharStreamBuffered.jav >>>> a:139) >>>> at org.apache.jena.atlas.io.**PeekReader.advanceAndSet(** >>>> PeekReader.java:251) >>>> at org.apache.jena.atlas.io.**PeekReader.init(PeekReader.**java:244) >>>> at org.apache.jena.atlas.io.**PeekReader.peekChar(** >>>> PeekReader.java:169) >>>> at org.apache.jena.atlas.io.**PeekReader.makeUTF8(** >>>> PeekReader.java:108) >>>> at >>>> org.apache.jena.riot.tokens.**TokenizerFactory.** >>>> makeTokenizerUTF8(TokenizerFa >>>> ctory.java:41) >>>> at org.apache.jena.riot.**RiotReader.createParser(** >>>> RiotReader.java:130) >>>> at org.apache.jena.riot.**RiotReader.parse(RiotReader.**java:115) >>>> at org.apache.jena.riot.**RiotReader.parse(RiotReader.**java:93) >>>> at org.apache.jena.riot.**RiotReader.parse(RiotReader.**java:66) >>>> at >>>> com.hp.hpl.jena.tdb.store.**bulkloader2.**CmdNodeTableBuilder.exec(** >>>> CmdNodeTabl >>>> eBuilder.java:163) >>>> at arq.cmdline.CmdMain.**mainMethod(CmdMain.java:101) >>>> at arq.cmdline.CmdMain.mainRun(**CmdMain.java:63) >>>> at arq.cmdline.CmdMain.mainRun(**CmdMain.java:50) >>>> at >>>> com.hp.hpl.jena.tdb.store.**bulkloader2.**CmdNodeTableBuilder.main(** >>>> CmdNodeTabl >>>> eBuilder.java:81) >>>> Caused by: java.nio.charset.**MalformedInputException: Input length = 1 >>>> at java.nio.charset.CoderResult.**throwException(CoderResult.** >>>> java:277) >>>> at sun.nio.cs.StreamDecoder.**implRead(StreamDecoder.java:**338) >>>> at sun.nio.cs.StreamDecoder.read(**StreamDecoder.java:177) >>>> at java.io.InputStreamReader.**read(InputStreamReader.java:**184) >>>> at java.io.Reader.read(Reader.**java:140) >>>> ... 17 more >>>> >>>> >>>> >>>> Is it possible that I only need certain files from my TDB directory? I >>>> am >>>> pretty sure my TDB is not malformed, because I have run a lot of queries >>>> on >>>> it successfully. >>>> >>>> >>>> >>>> On Thu, Aug 8, 2013 at 5:34 PM, Andy Seaborne <[email protected]> wrote: >>>> >>>> On 08/08/13 20:33, Brad Moran wrote: >>>>> >>>>> I am trying to use tdbloader2 on mac os 10.8.4 from command line. I >>>>>> have >>>>>> all my triples successfully loaded into a tdb stored in directory >>>>>> "tdb." I >>>>>> am sure it is successfully loaded because I can run any query on it >>>>>> successfully. So I try: >>>>>> >>>>>> apache-jena-2.10.1/bin/****tdbloader2 >>>>>> --loc=NetBeansProjects/****mdrolder/trunk/luceneIndexes >>>>>> NetBeansProjects/mdrolder/****trunk/tdb >>>>>> >>>>>> >>>>> >>>>> The command format is >>>>> >>>>> tdbloader2 --loc LOC DATAFILE DATAFILE2 DATAFILE3 .... >>>>> >>>>> The data comes from miles, not a directory. >>>>> >>>>> It does not take a directory and find all the files. You have to >>>>> enumerate the files. >>>>> >>>>> Andy >>>>> >>>>> >>>>> >>>>> And I get Exception in thread "main" org.apache.jena.atlas.** >>>>>> AtlasException: >>>>>> java.io.FileNotFoundException: NetBeansProjects/mdrolder/**** >>>>>> trunk/tdb >>>>>> (Is >>>>>> a >>>>>> directory). >>>>>> Which makes sense, I just do not understand how I am supposed to load >>>>>> the >>>>>> tdb though. Should I just use one of the files within the TDB? >>>>>> >>>>>> Thanks, >>>>>> Brad >>>>>> >>>>>> >>>>>> >>>>> >>> >>> >> >
