Hi Satish,

The connector will mirror anything in a SharePoint document Libary to 
MarkLogic.  A SharePoint document library could contain XML files, or the files 
could be text, but they're most likely Office documents.  

MarkLogic Server provides a feature we call the Content Processing Framework 
(CPF).  This framework allows us to create pipelines that will transform 
document content as soon as its saved to the Server.

If the mirrored documents are from Office 2007, the Server comes with the 
Office Open XML Extract pipeline, which will automatically unzip the documents 
and save the XML files they are composed of. Also, we provide the MarkLogic 
Toolkits for Word, Excel, and PowerPoint, which each come with pipelines and 
XQuery APIs for simplifying the use of Open XML within the Server.  

If the mirrored documents are Office 2003 flavor, the Server also comes with 
pipelines for transforming these formats to XHTML as well.

You can find more information about the Content Processing Framework and the 
Office Toolkits on our developer site.  But in a nutshell: It may start out as 
a Binary Object in SharePoint, but once it's in MarkLogic, we save it in the 
format we want for search, re-use, and delivery to multiple consumers.  The 
format you save as will be determined by your particular use-case and what 
solution you're ultimately trying to provide to your users/authors/etc.

http://developer.marklogic.com/

Hope this helps,
Pete


________________________________________
From: [email protected] 
[[email protected]] On Behalf Of Satish 
[[email protected]]
Sent: Tuesday, June 15, 2010 8:58 PM
To: [email protected]
Subject: [MarkLogic Dev General] Sharepoint connector for Marklogic

Hi,

I was interested in Sharepoint Connector for Marklogic, after reading through 
the document, I have couple of questions.
The document says the sharepoint library can be mirrored to Marklogic, so the 
documents are copied to Marklogic from Sharepoint.So I assume they are stored 
as BLOBs inside Marklogic and not converted to XML format.Is this correct? If 
so, How I can have custom search applications on this content which is in the 
form of BLOBs inside Marklogic?How do justify that the Marklogic search on this 
content is better than other search?

Any clarifications on these will be greatly helpful. Also if anyone has already 
implemented Sharepoint connector for Marklogic. request to share your 
experiences on the same.

Thanks,
satish.
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to