A little more information...the mime type in all these cases is null.
On Wed, Nov 20, 2013 at 12:26 PM, Mark Libucha <[email protected]> wrote: > I noticed this in the logs today...so it appears to be a problem with the > output connector, not with the SharePoint crawler. Sorry for the confusion. > > SharePoint: Skipping document '/test3/SitePages//greg data.aspx' because > output connector says mime type is not indexable > > Would be nice if the log message included the mime type. > > > On Tue, Nov 19, 2013 at 8:08 PM, Mark Libucha <[email protected]> wrote: > >> EnumerateFolder? >> >> http://msdn.microsoft.com/en-us/library/ms774758(v=office.12).aspx >> >> I really have no idea, and I'm not trying to add work to your plate. >> >> I'm just trying to understand what MCF does and does not crawl on a >> SharePoint site. I think you answered that. >> >> Thanks, >> >> Mark >> >> >> On Tue, Nov 19, 2013 at 5:32 PM, Karl Wright <[email protected]> wrote: >> >>> Hi Mark, >>> >>> The only content the current SharePoint connector crawls is: >>> - Sites and subsites >>> - Libraries, and files within >>> - Lists, and listitems, and attachments of listitems >>> >>> The connector does not currently know how to discover SitePages or crawl >>> them. If you would like this functionality, you will need to create a >>> ticket and hope that I have time to figure out how to do it using the >>> available web services. >>> >>> :-) >>> Karl >>> >>> >>> >>> >>> >>> On Tue, Nov 19, 2013 at 8:27 PM, Mark Libucha <[email protected]>wrote: >>> >>>> None of my crawls ever turn up SP "site pages". These are just normal >>>> .aspx web pages. The URLs look something like this: >>>> >>>> http://mydomain.com/mysite/SitePages/mark%20data.aspx >>>> >>>> I've tried tweaking the "Allowed file extensions", but can't get it to >>>> work. >>>> >>>> I'm using MCF 4.1. >>>> >>>> Any other suggestions? >>>> >>>> Thanks, >>>> >>>> Mark >>>> >>> >>> >> >
