Thank you Karl.

For the record: To achieve my demand I had to create an "Allowed documents" Transformation connection and insert that connection into a job pipeline.
Hi Igor,

The documentation for the JCIFS connector is pretty
good: 
https://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#jcifsrepository
[1]

It describes how to construct matches for the Paths tab, using * and ?
as wildcards.  They are not regular expressions.

Karl

On Mon, Nov 30, 2015 at 7:25 PM, <[email protected]> wrote:

How can I write an expression to match only certain file types to
index?
For example, I just want to index .pdf .doc and .odt files

Creating a crawl Job I'd tried to insert a rule "Include indexable
file(s) matching", in "Paths" filters, with this regular expression
/(?i:.(pdf|doc|odt))$ but everything was indexed, seems the
expression was not validated, the same with /.(pdf|doc|odt)$/i and
".(pdf|doc|odt)$"i
What is the proper way to write filters? Some example?

Thanks!
Igor Kattar



Links:
------
[1]
https://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#jcifsrepository

Reply via email to