I don't believe heavy load is a requirement. I'm pretty sure you can deadlock pretty easily if you try writing within an iterator.

Focus on Accismus would be best IMO, but, like Bill said, it's probably not fully there.

On 4/25/14, 11:42 PM, William Slacum wrote:
Our own Keith Turner is trying to make this possible with Accismus
(https://github.com/keith-turner/Accismus). I don't know the current
state of it, but I believe it's still in the early stages.

I've always been under the impression that launching a scanner or writer
from within an iterator, as it can cause deadlock in the system if it is
under heavy load.

  If it doesn't meet your needs, I'd recommend writing a daemon process
that identifies new documents via a scanner and filter, then write
indices for it. It's more network bound than doing it in an iterator,
but it's safer.



On Fri, Apr 25, 2014 at 11:29 PM, David Medinets
<[email protected] <mailto:[email protected]>> wrote:

    Can you change the ingest process to token on ingest?


    On Fri, Apr 25, 2014 at 10:45 PM, BlackJack76 <[email protected]
    <mailto:[email protected]>> wrote:

        Sure thing.  Basically, I am attempting to index a document.
          When I find the
        document, I want to insert the tokens directly back into the
        table.  I want
        to do it directly from the seek routine so that I don't need to
        return
        anything back to the client.

        For example, seek may locate the document that has the following
        sentence:

        The quick brown fox

         From there, I tokenize the document and want to insert the
        individual tokens
        back into tokens back into Accumulo (i.e., The, quick, brown,
        and fox all as
        separate mutations).



        --
        View this message in context:
        
http://apache-accumulo.1065345.n5.nabble.com/Write-to-table-from-Accumulo-iterator-tp9412p9414.html
        Sent from the Users mailing list archive at Nabble.com.



Reply via email to