Hi Lalit, Have you added any metadata rules on the job's Metadata tab?
See http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#sharepointrepository . Karl On Thu, May 29, 2014 at 12:32 PM, lalit jangra <[email protected]> wrote: > Thanks Karl, > > With your help, i am able to content indexed in my solr with logs as below > with some meaningful value to literal.allow_token_document variable. But > now i am struggling with not able to get any property indexed from > sharepoint to solr. On MCF job page, i have put sharepoint content > properties such as Name, Title, GUID etc. which are mapped to fields in > my solr schema but i am able to see only GUID property filled with metadata > & not any other. > > Can you help here? > > content_id={25CAEB55-ECEB-4ACC-A45F-78E35E877024}&literal.id= > http://testirishwaterportal/sites/hr/Documents/A2.docx&resource.name=A2.docx&literal.allow_token_document=Agrp:GApprovers&literal.allow_token_document=Agrp:GDesigners&literal.allow_token_document=Agrp:GHR%2BMembers&literal.allow_token_document=Agrp:GHR%2BOwners&literal.allow_token_document=Agrp:GHR%2BVisitors&literal.allow_token_document=Agrp:GHierarchy%2BManagers&literal.allow_token_document=Agrp:GRestricted%2BReaders&literal.allow_token_document=Agrp:GViewers&literal.allow_token_document=Agrp:Uc%253A0%2528.s%257Ctrue&literal.allow_token_document=Agrp:Ui%253A0%2523.w%257Ciwater%255Cadministrator&wt=xml&version=2.2} > {add=[http://testirishwaterportal/sites/hr/Documents/A2.docx > (1469453279104598016)]} 0 64 > > INFO - 2014-05-29 17:10:51.533; > org.apache.solr.update.processor.LogUpdateProcessor; [collection1] > webapp=/solr1 path=/update/extract > params={literal.deny_token_document=Agrp:DEAD_AUTHORITY&literal.content_id={90CDF9E2-12F3-49C3-A37A-DF3F60DBC44F}& > literal.id= > http://testirishwaterportal/sites/hr/Documents/Test%252011111.docx&resource.name=Test+11111.docx&literal.allow_token_document=Agrp:GApprovers&literal.allow_token_document=Agrp:GDesigners&literal.allow_token_document=Agrp:GHR%2BMembers&literal.allow_token_document=Agrp:GHR%2BOwners&literal.allow_token_document=Agrp:GHR%2BVisitors&literal.allow_token_document=Agrp:GHierarchy%2BManagers&literal.allow_token_document=Agrp:GRestricted%2BReaders&literal.allow_token_document=Agrp:GViewers&literal.allow_token_document=Agrp:Uc%253A0%2528.s%257Ctrue&literal.allow_token_document=Agrp:Ui%253A0%2523.w%257Ciwater%255Cadministrator&wt=xml&version=2.2} > {add=[http://testirishwaterportal/sites/hr/Documents/Test%2011111.docx > (1469453279196872704)]} 0 63 > > > > On Thu, May 29, 2014 at 12:28 PM, Karl Wright <[email protected]> wrote: > >> Hi Lalit, >> >> deny_token_document being set to DEAD_AUTHORITY seems to imply you have >> selected "active directory" as the authorization type for you connection. >> (This is done on the Authority Type tab.) But it may be the case that you >> are using SharePoint in Claims-based mode. If that's true, you should have: >> >> - Select "Native" as the authority type >> - Set up a SharePoint/Native authority >> - If you have AD involved, also set up a SharePoint/ActiveDirectory >> authority. >> >> FWIW, the two metadata values you want to watcht in the Solr URL are: >> >> literal.deny_token_document=DEAD_AUTHORITY >> literal.allow_token_document= >> >> You should see something in the allow field if your configuration is >> right, and the SharePoint document is visible to anyone at all. >> >> You will also need to add appropriate fields in Solr for security tokens, >> but I imagine you've already done that. >> Thanks, >> Karl >> >> >> >> On Thu, May 29, 2014 at 6:43 AM, lalit jangra <[email protected]> >> wrote: >> >>> Hi, >>> >>> I have configured a job to crawl sharepoint with Apache MCF & storing >>> index in solr.I run the job, it works fine without any error on screen & >>> MCF logs and completes elegantly. I have a custom solr schema that works >>> for alfresco fine. >>> >>> Now when i go back to solr admin screen to query added documents, i am >>> not able to see any sharepoint docs at all & see some deny_token details in >>> solr logs. I also have tried to map/unmap properties from sharepoint to >>> solr in MCf job screen but of no avail. >>> >>> Can anyone help me here? >>> >>> INFO - 2014-05-29 10:34:18.100; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract params={commit=true&wt=xml&version=2.2} >>> {commit=} 0 95 >>> INFO - 2014-05-29 10:41:16.555; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract >>> params={literal.GUID={065F52D2-0192-43CF-B0FE-6B7DF4A25561}&literal.deny_token_document=DEAD_AUTHORITY& >>> literal.id= >>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID%3D3&resource.name=docname&literal.allow_token_document=&wt=xml&version=2.2} >>> {add=[ >>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID=3 >>> (1469428768758038528)]} 0 3 >>> INFO - 2014-05-29 10:41:16.576; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract >>> params={literal.GUID={179F2C90-14A0-4097-A9F4-C0D2CD9D65B1}&literal.deny_token_document=DEAD_AUTHORITY& >>> literal.id= >>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID%3D2&resource.name=docname&literal.allow_token_document=&wt=xml&version=2.2} >>> {add=[ >>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID=2 >>> (1469428768771670016)]} 0 11 >>> INFO - 2014-05-29 10:41:17.004; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract >>> params={literal.GUID={065F52D2-0192-43CF-B0FE-6B7DF4A25561}:IrishWater_-_ECM_-_High_Availability_Design.docx&literal.deny_token_document=DEAD_AUTHORITY& >>> literal.id= >>> http://sharepontsite/sites/hr/Lists/Announcements/Attachments/3/IrishWater_-_ECM_-_High_Availability_Design.docx&resource.name=IrishWater_-_ECM_-_High_Availability_Design.docx&literal.allow_token_document=&wt=xml&version=2.2} >>> {add=[ >>> http://sharepontsite/sites/hr/Lists/Announcements/Attachments/3/IrishWater_-_ECM_-_High_Availability_Design.docx >>> (1469428769214169088)]} 0 199 >>> INFO - 2014-05-29 10:41:17.343; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract >>> params={literal.GUID={39D4D9C1-301B-4082-94D7-818323509ABC}&literal.deny_token_document=DEAD_AUTHORITY& >>> literal.id= >>> http://sharepontsite/sites/hr/Shared%2520Documents/IrishWater_-_ECM_-_High_Availability_Design.docx&resource.name=IrishWater_-_ECM_-_High_Availability_Design.docx&literal.allow_token_document=&wt=xml&version=2.2} >>> {add=[ >>> http://sharepontsite/sites/hr/Shared%20Documents/IrishWater_-_ECM_-_High_Availability_Design.docx >>> (1469428769581170688)]} 0 171 >>> INFO - 2014-05-29 10:41:31.555; >>> org.apache.solr.update.DirectUpdateHandler2; start >>> commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false} >>> INFO - 2014-05-29 10:41:31.662; >>> org.apache.solr.core.SolrDeletionPolicy; SolrDeletionPolicy.onCommit: >>> commits: num=2 >>> >>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index >>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490; >>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_9,generation=9} >>> >>> >>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index >>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490; >>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_a,generation=10} >>> INFO - 2014-05-29 10:41:31.663; >>> org.apache.solr.core.SolrDeletionPolicy; newest commit generation = 10 >>> INFO - 2014-05-29 10:41:31.669; >>> org.apache.solr.search.SolrIndexSearcher; Opening Searcher@2cf5f14b >>> realtime >>> INFO - 2014-05-29 10:41:31.670; >>> org.apache.solr.update.DirectUpdateHandler2; end_commit_flush >>> INFO - 2014-05-29 10:41:49.886; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract >>> params={literal.GUID={90CDF9E2-12F3-49C3-A37A-DF3F60DBC44F}&literal.deny_token_document=DEAD_AUTHORITY& >>> literal.id= >>> http://sharepontsite/sites/hr/Documents/Test%252011111.docx&resource.name=Test+11111.docx&literal.allow_token_document=&wt=xml&version=2.2} >>> {add=[http://sharepontsite/sites/hr/Documents/Test%2011111.docx >>> (1469428803708125184)]} 0 55 >>> INFO - 2014-05-29 10:41:49.982; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract >>> params={literal.GUID={25CAEB55-ECEB-4ACC-A45F-78E35E877024}&literal.deny_token_document=DEAD_AUTHORITY& >>> literal.id= >>> http://sharepontsite/sites/hr/Documents/A2.docx&resource.name=A2.docx&literal.allow_token_document=&wt=xml&version=2.2} >>> {add=[http://sharepontsite/sites/hr/Documents/A2.docx >>> (1469428803806691328)]} 0 49 >>> INFO - 2014-05-29 10:41:58.249; >>> org.apache.solr.update.DirectUpdateHandler2; start >>> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false} >>> INFO - 2014-05-29 10:41:58.424; >>> org.apache.solr.core.SolrDeletionPolicy; SolrDeletionPolicy.onCommit: >>> commits: num=2 >>> >>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index >>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490; >>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_a,generation=10} >>> >>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index >>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490; >>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_b,generation=11} >>> INFO - 2014-05-29 10:41:58.425; >>> org.apache.solr.core.SolrDeletionPolicy; newest commit generation = 11 >>> INFO - 2014-05-29 10:41:58.433; >>> org.apache.solr.search.SolrIndexSearcher; Opening Searcher@4f1ea922 main >>> INFO - 2014-05-29 10:41:58.435; >>> org.apache.solr.core.QuerySenderListener; QuerySenderListener sending >>> requests to Searcher@4f1ea922 >>> main{StandardDirectoryReader(segments_b:47:nrt _c(4.6):C4 _e(4.6):C1 >>> _d(4.6):C1)} >>> INFO - 2014-05-29 10:41:58.436; >>> org.apache.solr.core.QuerySenderListener; QuerySenderListener done. >>> INFO - 2014-05-29 10:41:58.437; >>> org.apache.solr.update.DirectUpdateHandler2; end_commit_flush >>> INFO - 2014-05-29 10:41:58.441; org.apache.solr.core.SolrCore; >>> [collection1] Registered new searcher Searcher@4f1ea922 >>> main{StandardDirectoryReader(segments_b:47:nrt _c(4.6):C4 _e(4.6):C1 >>> _d(4.6):C1)} >>> INFO - 2014-05-29 10:41:58.444; >>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1] >>> webapp=/solr1 path=/update/extract params={commit=true&wt=xml&version=2.2} >>> {commit=} 0 195 >>> >>> >>> >>> -- >>> Regards, >>> Lalit Jangra. >>> >> >> > > > -- > Regards, > Lalit Jangra. >
