I am currently cleansing a considerable amount of textual data that is already loaded into Neo4j. Since Cypher string functions are very limited (for the time being), I developed a minimalist Neo4j server plugin that lets me run my regex-based string transformations server-side. Maybe this functionality already exists somewhere in some library that I am not aware of, in which case I will be glad if you point it out. If not, you are welcome to use this plugin if you need to do some text-crunching inside the graph db.
Download the plugin here: https://github.com/mszargar/Regx4Neo/releases And the readme file is here: https://github.com/mszargar/Regx4Neo/blob/master/README.md All the plugin does is to filter nodes by a specific label and run a regex against the content of a specific property using Java's native regex methods, and then to transform the string based on regex pattern matches. Currently it is capable of doing the basics: It can substitute matching instances with another string, it can split the property content over the matching instances or it can simply point out the specific pattern matches. The first operation results in a new string for each string node processed, while the last two result in an array of strings per string node processed. The plugin can return the results or alternatively it can save them in another property for further processing. I hope this saves someone some time! Cheers, Mah -- You received this message because you are subscribed to the Google Groups "Neo4j" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
