Glad to be useful. Regards

----- Mensaje original -----
> De: "Dave Beckstrom" <dbeckst...@figleaf.com>
> Para: "user" <user@nutch.apache.org>
> Enviados: Martes, 5 de Marzo 2019 11:06:35
> Asunto: [MASSMAIL]JEXL and Exchanges

> Ryan and Roannel,
> 
> Thank you guys so much for your replies.  I didn't realize it but I was not
> seeing all of the emails from you.
> 
> Roannel you sent some really helpful replies that never came in as an
> email.  I found your replies when I browsed the web-based archives on the
> apache site.   I wanted to make sure I thanked you for your help!!!
> 
> I can't find one example of an exchanges.xml other than what ships with
> Nutch.   I'm really in the blind trying to get the exchanges to work.  I
> believe this may be the last item I need help with and then I'll have Nutch
> working the way I need it to.  Any help you can offer would be GREATLY
> appreciated.
> 
> Let's say I have a document that was crawled and the URL for the document
> was as follows:
> 
> http://www.somedomain.com/news/englishnews/2018/this-is-my-news-article.cfm
> 
> Here is the expression I have coded in exchanges.xml:
> 
> <param name="expr" value="doc.getFieldValue('url')=~'/englishnews/'" />
> 
> That expression is not triggering.  As near as I can tell the "=~" is the
> "contains" expression.  The idea being if the url contains "englishnews"
> then this expression should trigger.  I believe the slashes around
> "englishnews" makes it function as a regular expression, which should
> evaluate to true, rather then a string compare.
> 
> If anyone can help get me past this final road block I would greatly
> appreciate the help!  I spent an entire day on this yesterday and got
> nowhere.
> 
> Thank you!
> 
> Dave
> 
> --
> *Fig Leaf Software, Inc.*
> https://www.figleaf.com/
> <https://www.figleaf.com/>
> 
> Full-Service Solutions Integrator

Reply via email to