I don't believe heavy load is a requirement. I'm pretty sure you can
deadlock pretty easily if you try writing within an iterator.
Focus on Accismus would be best IMO, but, like Bill said, it's probably
not fully there.
On 4/25/14, 11:42 PM, William Slacum wrote:
Our own Keith Turner is trying to make this possible with Accismus
(https://github.com/keith-turner/Accismus). I don't know the current
state of it, but I believe it's still in the early stages.
I've always been under the impression that launching a scanner or writer
from within an iterator, as it can cause deadlock in the system if it is
under heavy load.
If it doesn't meet your needs, I'd recommend writing a daemon process
that identifies new documents via a scanner and filter, then write
indices for it. It's more network bound than doing it in an iterator,
but it's safer.
On Fri, Apr 25, 2014 at 11:29 PM, David Medinets
<[email protected] <mailto:[email protected]>> wrote:
Can you change the ingest process to token on ingest?
On Fri, Apr 25, 2014 at 10:45 PM, BlackJack76 <[email protected]
<mailto:[email protected]>> wrote:
Sure thing. Basically, I am attempting to index a document.
When I find the
document, I want to insert the tokens directly back into the
table. I want
to do it directly from the seek routine so that I don't need to
return
anything back to the client.
For example, seek may locate the document that has the following
sentence:
The quick brown fox
From there, I tokenize the document and want to insert the
individual tokens
back into tokens back into Accumulo (i.e., The, quick, brown,
and fox all as
separate mutations).
--
View this message in context:
http://apache-accumulo.1065345.n5.nabble.com/Write-to-table-from-Accumulo-iterator-tp9412p9414.html
Sent from the Users mailing list archive at Nabble.com.