FYI, I wrote functionality to enable Lucene text analysis components to be used to extract text features via a transformer in spark.ml pipelines. Non-machine-learning uses supported too.
See my blog describing the capabilities, which are included in the open-source spark-solr project: <https://lucidworks.com/blog/2016/04/13/spark-solr-lucenetextanalyzer/> Feedback welcome! -- Steve Rowe www.lucidworks.com --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org