I mentioned those as examples only. I am not asking to do anything repository 
specific. In the first case I want to post the plain text of a file to an out 
put as-is with no wrappers added by manifold and no encoding.

In the second case I want to merge some properties on a document with the 
existing JSON from the file content and then post that to the output as-is.

Jeff



> On Aug 27, 2015, at 3:58 PM, Karl Wright <[email protected]> wrote:
> 
> Hi,
> 
> ManifoldCF's connectors are general purpose; they are intended to work with 
> *any* repository or output.  So in general, connectors in MCF cannot 
> interpret or generate content that is Alfresco or ElasticSearch specific.
> 
> You are welcome to convert these documents to ManifoldCF's means of managing 
> documents, RepositoryDocument, in your repository connector, and then convert 
> them back in your output connector.  Or, if you want to write specific 
> proprietary connectors that communicate in a specific format of JSON, you 
> can.  But do not expect ManifoldCF's suite of other connectors and 
> transformers to work with this in any meaningful way.
> 
> Karl
> 
> 
>> On Thu, Aug 27, 2015 at 4:29 PM, Shinichiro Abe <[email protected]> 
>> wrote:
>> Hi,
>> 
>> I’m work in progress at https://issues.apache.org/jira/browse/CONNECTORS-1234
>> 
>> Regards,
>> Shinichiro Abe
>> 
>> > 2015/08/28 4:47、Jeff Potts <[email protected]> のメール:
>> >
>> > I've spent a very short time playing with ManifoldCF. Cool project, thank 
>> > you for contributing it.
>> >
>> > I can read binary files from a source repo like Alfresco 5.0.d and post 
>> > them to Elasticsearch 1.7.2 successfully.
>> >
>> > Now I'm wondering if the rest of my use cases can be achieved with 
>> > ManifoldCF...
>> >
>> > Use case 1: Read JSON from a file system, post to Elasticsearch as-is
>> >
>> > When I tried to use the file system repository and the Elasticsearch 
>> > output, I noticed that the file is being encoded and stored in ES in the 
>> > _content property. What I'd rather do is have the file posted to ES as-is, 
>> > such as if the file is already a JSON document in the expected format for 
>> > my type mapping in ES. These files are 15k to 30k of nested object JSON.
>> >
>> > Use case 2: Read JSON from Alfresco, post it to Elasticsearch along with 
>> > object metadata
>> >
>> > In a slight twist on the first, I'd like to store JSON documents in a 
>> > repository, like Alfresco, and then read the metadata from the Alfresco 
>> > object and merge it with the JSON stored in the content and post that to 
>> > Elasticsearch as a JSON string, not as an encoded blob.
>> >
>> > I didn't see anything covering these in the docs but I may have missed it.
>> >
>> > Jeff
>> 
> 

Reply via email to