I noticed this in the logs today...so it appears to be a problem with the output connector, not with the SharePoint crawler. Sorry for the confusion.
SharePoint: Skipping document '/test3/SitePages//greg data.aspx' because output connector says mime type is not indexable Would be nice if the log message included the mime type. On Tue, Nov 19, 2013 at 8:08 PM, Mark Libucha <[email protected]> wrote: > EnumerateFolder? > > http://msdn.microsoft.com/en-us/library/ms774758(v=office.12).aspx > > I really have no idea, and I'm not trying to add work to your plate. > > I'm just trying to understand what MCF does and does not crawl on a > SharePoint site. I think you answered that. > > Thanks, > > Mark > > > On Tue, Nov 19, 2013 at 5:32 PM, Karl Wright <[email protected]> wrote: > >> Hi Mark, >> >> The only content the current SharePoint connector crawls is: >> - Sites and subsites >> - Libraries, and files within >> - Lists, and listitems, and attachments of listitems >> >> The connector does not currently know how to discover SitePages or crawl >> them. If you would like this functionality, you will need to create a >> ticket and hope that I have time to figure out how to do it using the >> available web services. >> >> :-) >> Karl >> >> >> >> >> >> On Tue, Nov 19, 2013 at 8:27 PM, Mark Libucha <[email protected]> wrote: >> >>> None of my crawls ever turn up SP "site pages". These are just normal >>> .aspx web pages. The URLs look something like this: >>> >>> http://mydomain.com/mysite/SitePages/mark%20data.aspx >>> >>> I've tried tweaking the "Allowed file extensions", but can't get it to >>> work. >>> >>> I'm using MCF 4.1. >>> >>> Any other suggestions? >>> >>> Thanks, >>> >>> Mark >>> >> >> >
