I am currently cleansing a considerable amount of textual data that is 
already loaded into Neo4j. Since Cypher string functions are very limited 
(for the time being), I developed a minimalist Neo4j server plugin that 
lets me run my regex-based string transformations server-side. Maybe this 
functionality already exists somewhere in some library that I am not aware 
of, in which case I will be glad if you point it out. If not, you are 
welcome to use this plugin if you need to do some text-crunching inside the 
graph db.

Download the plugin here:

https://github.com/mszargar/Regx4Neo/releases

And the readme file is here:

https://github.com/mszargar/Regx4Neo/blob/master/README.md

All the plugin does is to filter nodes by a specific label and run a regex 
against the content of a specific property using Java's native regex 
methods, and then to transform the string based on regex pattern matches. 
Currently it is capable of doing the basics: It can substitute matching 
instances with another string, it can split the property content over the 
matching instances or it can simply point out the specific pattern matches. 
The first operation results in a new string for each string node processed, 
while the last two result in an array of strings per string node processed. 
The plugin can return the results or alternatively it can save them in 
another property for further processing.

I hope this saves someone some time!

Cheers,
Mah

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to