-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Michael,
Trying again in case I got the wrong file. Downloading: http://dist.neo4j.org/jexp/shell/neo4j-shell-tools-2.0.zip Perhaps I did lower case a instead of A on the replace question? Continuing. Restarted server, neo4j-shell, exporting.... OK, running SGML validation shows only the upper ASCII characters as errors. Thanks! I must have messed up the overwrite of the file in lib. When prompted. Hope you are having a great week! Patrick PS: Tomorrow, late for me here, will run HTML Tidy (c version) to see if it cleans out the upper ASCII stuff. On 01/21/2014 06:58 PM, Michael Hunger wrote: > Really that's weird, b/c I changed the label format to > :labe1l:label2:label3 But it still shows the comma separated one in > your case. > > Perhaps it was not up to date? Where did you download it? > > Michael > > My current export looks like this: > > <?xml version="1.0" encoding="UTF-8"?> <graphml > xmlns="http://graphml.graphdrawing.org/xmlns" > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xsi:schemaLocation="http://graphml.graphdrawing.org/xmlns > http://graphml.graphdrawing.org/xmlns/1.0/graphml.xsd"> <graph > id="G" edgedefault="directed"> <key id="na<>me" for="node" > attr.name="na<>me" attr.type="string"/> <key id="count" > for="edge" attr.name="count" attr.type="int"/> <node id="n0" > labels=":FOO"><data key="labels">:FOO</data><data > key="na<>me">John & Dö</data></node> <edge id="e0" > source="n0" target="n0" label="BAR"><data > key="label">BAR</data><data key="count">0</data></edge> </graph> > </graphml> > > Am 21.01.2014 um 20:06 schrieb Patrick Durusau > <[email protected]>: > > Michael, > > I just downloaded the latest neo4j-shell-tools and an export got: > > <node id="n1" labels="User,expertValidation,SeedNode" ><data > key="labels">User,expertValidation,SeedNode</data><data > key="id_str">269740110</data><data key="name">Andreyana > Ivanova</data><data key="screen_name">adiivanova</data><data > key="description">Passionate and inspiring Equality & Diversity > Practitioner</data><data key="followers_count">35</data><data > key="friends_count">52</data><data > key="listed_count">2</data><data > key="statuses_count">11</data><data > key="favourites_count">0</data><data > key="location">London</data><data > key="time_zone">London</data><data key="utc_offset">0</data><data > key="lang">en</data><data > key="profile_image_url">http://pbs.twimg.com/profile_images/1281406234/IMG_5547_normal.JPG</data><data > > key="geo_enabled">false</data><data key="verified">false</data><data > key="notifications">false</data></node> > > Between "Equality & Divesity" is an example of a & that should be > written as: &" > > Sorry for not checking my email earlier but I wanted to create a > file with several examples (I have inserted XML comments for each > example) of what markup errors I am picking up. > > BTW, there is a edge case where < and > should not be converted > but that is in XML processing instructions and it is unlikely > anyone will be encountering those in an graph database. As soon as > I hit "send" an example case will hit the email list. ;-) > > It has been a while since I have looked at conversion filter > libraries but I suspect there is something that would correct the > character and markup errors automatically on export. HTML tidy I > think has that capacity. > > Anyway file attached. > > Hope you are having a great week! > > Patrick > > On 01/20/2014 10:33 PM, Michael Hunger wrote: >>>> Patrick, >>>> >>>> the xml encoding issues for <> & etc. should be addressed. >>>> >>>> Not sure how do deal with the control characters though. The >>>> only thing I could think of is to write data as CDATA >>>> fields? >>>> >>>> Or strip them somehow upfront. >>>> >>>> Michael >>>> >>>> Am 21.01.2014 um 01:34 schrieb Patrick Durusau >>>> <[email protected]>: >>>> >>>> Michael, >>>> >>>> On 01/20/2014 02:48 AM, Michael Hunger wrote: >>>>>>> Thanks for the feedback, will fix these issues. >>>>>>> >>>>>>> Do you know where the control characters came from? >>>>>>> >>>> >>>> Guessing I would say that Twitter accepts pasted content. >>>> Works ok as long as you are in the lower ASCII set but for >>>> things like trademark (tm) and the R with a circle? Sorry, >>>> I'm real tired. >>>> >>>> I still have an uncorrected version of the data and will try >>>> to fish out the lines in question. The full file is large in >>>> email terms and most of it would not be helpful. >>>> >>>> I'll get some sleep and look at it in the morning. >>>> >>>> Will verify the problems still exist in Gephi as well with >>>> the much smaller version of the file. >>>> >>>> Hope you are having a great day! >>>> >>>> Patrick >>>> >>>> >>>>>>> Michael >>>>>>> >>>>>>> Am 20.01.2014 um 02:59 schrieb Patrick Durusau >>>>>>> <[email protected]>: >>>>>>> >>>>>>> Michael, >>>>>>> >>>>>>> I tried out the export to GraphML today. >>>>>>> >>>>>>> I was using data from a Twitter feed. >>>>>>> >>>>>>> The first issue on trying to load into GraphML was that >>>>>>> the "&" character was not written "&" >>>>>>> >>>>>>> When converting files to XML, escape "&" with "&", >>>>>>> "<" as < and ">" as > >>>>>>> >>>>>>> The next several issues were control characters ^B, ^C, >>>>>>> etc. embedded before TM and R, etc. >>>>>>> >>>>>>> Conversion to UTF-8 and stripping anything that >>>>>>> doesn't convert would be nice. >>>>>>> >>>>>>> The parser in my Emacs must not match what is being >>>>>>> used in Gephi because it would choke even though Emacs >>>>>>> said all was well. >>>>>>> >>>>>>> Hope you are at the start of a great week! >>>>>>> >>>>>>> Patrick >>>>>>> >>>>>>> On 01/17/2014 07:49 PM, Michael Hunger wrote: >>>>>>>>>> +1 that would be awesome >>>>>>>>>> >>>>>>>>>> I wanted to give it a try myself but haven't >>>>>>>>>> found the time. >>>>>>>>>> >>>>>>>>>> Btw. my neo4j-shell-tools now export Neo4j to >>>>>>>>>> GraphML, so you can visualize your db in Gephi, >>>>>>>>>> would love some feedback: >>>>>>>>>> >>>>>>>>>> https://github.com/jexp/neo4j-shell-tools/tree/20#graphml-export >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>> >>>>>>>>>> > >>>>>>>>>> Michael >>>>>>>>>> >>>>>>>>>> Am 18.01.2014 um 00:29 schrieb Marcelo Gagliano >>>>>>>>>> <[email protected] >>>>>>>>>> <mailto:[email protected]>>: >>>>>>>>>> >>>>>>>>>>> Hi, Caleb. >>>>>>>>>>> >>>>>>>>>>> Did you developed that client? If so, could you >>>>>>>>>>> share the source code? I am trying to create a >>>>>>>>>>> similar solution, but I am not having much >>>>>>>>>>> success. >>>>>>>>>>> >>>>>>>>>>> Thank you, Marcelo Gagliano >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On Friday, June 21, 2013 3:12:20 AM UTC-3, >>>>>>>>>>> Caleb Jones wrote: >>>>>>>>>>> >>>>>>>>>>> I'm currently working on building a Java client >>>>>>>>>>> for the Gephi streaming API and will be >>>>>>>>>>> presenting at the Seattle Graph Meetup group. >>>>>>>>>>> I'm aware of the Neo4j Gephi plugin >>>>>>>>>>> (https://marketplace.gephi.org/plugin/neo4j-graph-database-support/ >>>>>>>>>>> >>>>>>>>>>> >>>> >>>>>>>>>>> > >>>>>>>>>>> <https://marketplace.gephi.org/plugin/neo4j-graph-database-support/>) >>>>>>>>>>> >>>>>>>>>>> >>>>>>> and plan on showing how that can be used too, but I'm >>>>>>> curious if >>>>>>>>>>> there are any specific Neo4j applications that >>>>>>>>>>> would fit well with the streaming work I'm >>>>>>>>>>> doing. >>>>>>>>>>> >>>>>>>>>>> One thought is to have a mode in the streaming >>>>>>>>>>> client I'm writing that tees the streaming to >>>>>>>>>>> both Gephi and Neo4j. Of course, someone could >>>>>>>>>>> just stream to Gephi then export to Neo4j as >>>>>>>>>>> well. >>>>>>>>>>> >>>>>>>>>>> I'm not drowning in free time to do this, so >>>>>>>>>>> I'm looking for simple integrations to do. >>>>>>>>>>> >>>>>>>>>>> Thoughts? >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> -- You received this message because you are >>>>>>>>>>> subscribed to the Google Groups "Neo4j" group. >>>>>>>>>>> To unsubscribe from this group and stop >>>>>>>>>>> receiving emails from it, send an email to >>>>>>>>>>> [email protected] >>>>>>>>>>> <mailto:[email protected]>. >>>>>>>>>>> For more options, visit >>>>>>>>>>> https://groups.google.com/groups/opt_out. >>>>>>>>>> >>>>>>>>>> -- You received this message because you are >>>>>>>>>> subscribed to the Google Groups "Neo4j" group. To >>>>>>>>>> unsubscribe from this group and stop receiving >>>>>>>>>> emails from it, send an email to >>>>>>>>>> [email protected]. For more >>>>>>>>>> options, visit >>>>>>>>>> https://groups.google.com/groups/opt_out. >>>>>>> >>>>>>>> >>>>>>>> -- You received this message because you are >>>>>>>> subscribed to the Google Groups "Neo4j" group. To >>>>>>>> unsubscribe from this group and stop receiving emails >>>>>>>> from it, send an email to >>>>>>>> [email protected]. For more >>>>>>>> options, visit >>>>>>>> https://groups.google.com/groups/opt_out. >>>>>>> >>>> >>>>> >>>>> -- You received this message because you are subscribed to >>>>> the Google Groups "Neo4j" group. To unsubscribe from this >>>>> group and stop receiving emails from it, send an email to >>>>> [email protected]. For more options, >>>>> visit https://groups.google.com/groups/opt_out. >>>> > >> >> -- You received this message because you are subscribed to the >> Google Groups "Neo4j" group. To unsubscribe from this group and >> stop receiving emails from it, send an email to >> [email protected]. For more options, visit >> https://groups.google.com/groups/opt_out. <graphml-test.txt> > - -- Patrick Durusau [email protected] Technical Advisory Board, OASIS (TAB) Co-Chair, OpenDocument Format TC (OASIS) Editor, OpenDocument Format TC, Project Editor ISO/IEC 26300 Former Chair, V1 - US TAG to JTC 1/SC 34 Convener, JTC 1/SC 34/WG 3 (Topic Maps) Co-Editor, ISO 13250-5 (Topic Maps) Another Word For It (blog): http://tm.durusau.net Homepage: http://www.durusau.net Twitter: patrickDurusau -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iQIcBAEBAgAGBQJS3yn2AAoJEAudyeI2QFGo2iUP/17bqYLj3COK1H3kTyeVH1tA bL0oSewBbc8mtpA/DX3ziPBGMWSkt79o8OXwvpbDQR34VkmjpxzPin7n3kw9sqEg 906zNFCAauMcZwLS4zdVOLxW677g7j41bfh5E5h8JTCau0ktCjNZcrkp15C5aYLt cCEW9OlI418UwQLO+ixKtu5REx4hcWPtNfsPyCYn4xFqGqeZJK/nFuOV7dEK2Doo lz3TxOpaHOLvQyEn1jc5iPJmRyzaqetV1x8NNCmdNt18helKZTyICy7obdt4bDsA pb+2X1pkipu3Vj7pkYZj+EBG7pAFPJ+g2zkvye3+WFsOsXGmbaSu1AHVimUIOzMI 3H1zKihy/0jSCrsK4Fa0MkdI7uQOpQ93sRCK4oy8MdLEmzGwjp+g6kURWEnX7a4/ lACs3iqUBqlT4lDF0HVN5MsbFmOBSDfQpItdY9pJUOx6mIyYrcKGEaw7S1Kg6/g9 GeVVOXi8QZRiLtBiY7eS8D3N3/fNC2kQ65aKzN5cgtpaWY/joTUEseXnGxskEway Y7AbExuJ60geUoczOgDRLpkWM0fXmpuksCMkA25hgJnmQLNNI8ASluJA7WIR0dyE VvorWogWG/Nl94vjcUecoOat/kTEPVqj+guOTsLWCddQF8pv4kPsxv3xHeuj0fi1 AjqexDp+SHy2rNlosMXy =OOoq -----END PGP SIGNATURE----- -- You received this message because you are subscribed to the Google Groups "Neo4j" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.
