Christopher, It was weird synchronicity seeing this topic come up. I had just started looking at XML diff software and found the usual suspects: the IBM one (xmldiff), the SourceForge one (diffxml), a Python module (xmldiff) and the Microsoft one (xmldiffpatch).
My interest is in exposing changes in web services by providing a diff of the WSDL. I haven't gotten very far down the road. what I've learned: - naive diff is no good because hierarchical data can take many equivalent forms - There is no standard diff format, so those who implement it also usually write their own patch. - There are visual diffs, but I haven't gotten any of them to work. - The diff applications differ in complexity; the most complex will compare node names to attributes, etc., in trying to make a match. And to make this work, the app generally builds a DOM ... which limits the size of the documents it can compare. I don't know if that's helpful or not ... Here are links to the xml diffs I found: IBM: http://www.alphaworks.ibm.com/tech/xmldiffmerge SourceForge: http://diffxml.sourceforge.net/downloads.html Python: http://www.logilab.org/projects/xmldiff/ Microsoft: http://apps.gotdotnet.com/xmltools/xmldiff/ -- -- Ken Kress Information Technology LockwoodŽ 10 Valley Stream Parkway Malvern, PA 19355 Phone: (484) 605-4834 Fax: (484) 595-5700 Email: [EMAIL PROTECTED] Christopher R Newman <[EMAIL PROTECTED]> 04/04/2005 02:46 PM Please respond to general To: general@xml.apache.org cc: Subject: Re: XML differencing anyone? So noone has tried XML differencing, huh? Christopher R Newman wrote: I would probably prefer Perl since I actually know it. But I was wondering if anyone had any suggestions on specific programs/modules to use for XML differencing. Does anybody have experience with this? Chris jinwen lu wrote: Use Perl? I think Perl could be a good choice speed wise. jinwen --- Christopher R Newman <[EMAIL PROTECTED]> wrote: Curious if anyone has any experience with XML differencing, especially on an Apache server. I've been looking at the Perl modules like XML::SemanticDiff, XML::Diff, and XML::DifferenceMarkup, but they all seem to have some slight limitations. I just realized it might be might be possible for me to set up a Java servlet to run some of the Java-based XML differencing programs, but from what I've read so far many of them seem to be even worse off (slower, eat more memory, etc.). What would you all suggest I use and why? Any help would be greatly appreciated! Thanks in advance, Chris -- ====================================================================== Chris Newman Alcatel USA Work:(919)850-5586 Software Engineer 2301 Sugar Bush Rd. Fax:(919)850-6160 Test Automation Raleigh, NC 27601 [EMAIL PROTECTED] ====================================================================== ________________________________________________________________________ The information in this e-mail, and any attachment therein, is confidential and for use by the addressee only. If you are not the intended recipient, please return the e-mail to the sender and delete it from your computer. Although the Bank of New York attempts to sweep e-mail and attachments for viruses, it does not guarantee that either are virus-free and accepts no liability for any damage sustained as a result of viruses. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]