Re: [CODE4LIB] Tool to highlight differences in two files
Hi Wilhelmina, you might want to take a look at the work of Myriam Ben Saad: http://www-poleia.lip6.fr/~bensaadm/. She has a few papers describing extending a vision-based page segmentation algorithm (VIPS) to support webpage change analysis (Vi-DIFF). Here's one such paper from 2010: http://www-poleia.lip6.fr/~bensaadm/DEXA2010.pdf. The algorithm detects both visual changes and changes in the DOM. Unfortunately, I haven't yet seen any links to tools or production code using the approach. ~Nicholas -- https://twitter.com/nullhandle
Re: [CODE4LIB] Tool to highlight differences in two files
If you are on OS X then BBEdit will do this for you. On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
[CODE4LIB] Tool to highlight differences in two files
I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
The one resource that came immediately to mind was Juxta: http://www.juxtasoftware.org/juxta-commons/ Thanks! Jim *Jim DelRosso, MPA, MSLIS Digital Projects Coordinator* *Hospitality, Labor, and Management Library* Catherwood Library ILR School Cornell University 239D Ives Hall Ithaca, NY 14853 p 607.255.8688 f 607.255.9641 e jd...@cornell.edu www.ilr.cornell.edu *Advancing the World of Work* On Tue, Apr 23, 2013 at 4:24 PM, Wilhelmina Randtke rand...@gmail.comwrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
I've recently begun to use Beyond Compare: http://www.scootersoftware.com/ It's not free or OSS, though. There's also a plugin for Notepad++ that does something similar: http://sourceforge.net/projects/npp-compare/ This is free, of course. Thanks! Allie -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Wilhelmina Randtke Sent: Tuesday, April 23, 2013 4:24 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Tool to highlight differences in two files I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
WinMerge is a great diff tool, and it is easy to use. On Tue, Apr 23, 2013 at 4:29 PM, Jim DelRosso jd...@cornell.edu wrote: The one resource that came immediately to mind was Juxta: http://www.juxtasoftware.org/juxta-commons/ Thanks! Jim *Jim DelRosso, MPA, MSLIS Digital Projects Coordinator* *Hospitality, Labor, and Management Library* Catherwood Library ILR School Cornell University 239D Ives Hall Ithaca, NY 14853 p 607.255.8688 f 607.255.9641 e jd...@cornell.edu www.ilr.cornell.edu *Advancing the World of Work* On Tue, Apr 23, 2013 at 4:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke -- Terry Brady Applications Programmer Analyst Lauinger Information Technology 202-687-7053
Re: [CODE4LIB] Tool to highlight differences in two files
Hi Wilhelmina, We've used oXygen and Text Wrangler (but only for macs). regards, Monica On 4/23/2013 3:24 PM, Wilhelmina Randtke wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
The absolute simplest way to do this would be to fire up a terminal (OSX/Linux) and: diff page1.html page2.html | less Unfortunately, this will also catch changes made in other markup, and may or may not be terribly readable. On Tue, Apr 23, 2013 at 4:31 PM, Alevtina Verbovetskaya alevtina.verbovetsk...@mail.cuny.edu wrote: I've recently begun to use Beyond Compare: http://www.scootersoftware.com/ It's not free or OSS, though. There's also a plugin for Notepad++ that does something similar: http://sourceforge.net/projects/npp-compare/ This is free, of course. Thanks! Allie -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Wilhelmina Randtke Sent: Tuesday, April 23, 2013 4:24 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Tool to highlight differences in two files I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
On Tue, Apr 23, 2013 at 03:24:16PM -0500, Wilhelmina Randtke wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. diff --side-by-side file1 file2 ./fxk -Wilhelmina Randtke -- I went to the race track once and bet on a horse that was so good that it took seven others to beat him!
Re: [CODE4LIB] Tool to highlight differences in two files
VimDiff is quite nice. http://vimdoc.sourceforge.net/htmldoc/diff.html On Tue, Apr 23, 2013 at 4:43 PM, Francis Kayiwa kay...@uic.edu wrote: On Tue, Apr 23, 2013 at 03:24:16PM -0500, Wilhelmina Randtke wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. diff --side-by-side file1 file2 ./fxk -Wilhelmina Randtke -- I went to the race track once and bet on a horse that was so good that it took seven others to beat him! -- Sent from my GMail account.
Re: [CODE4LIB] Tool to highlight differences in two files
On Apr 23, 2013, at 4:37 PM, Alexander Duryee wrote: The absolute simplest way to do this would be to fire up a terminal (OSX/Linux) and: diff page1.html page2.html | less Unfortunately, this will also catch changes made in other markup, and may or may not be terribly readable. At the very least, I'd suggest adding a '-b' which will ignore changes to whitespace. Also see: http://www.w3.org/wiki/HtmlDiff -Joe On Tue, Apr 23, 2013 at 4:31 PM, Alevtina Verbovetskaya alevtina.verbovetsk...@mail.cuny.edu wrote: I've recently begun to use Beyond Compare: http://www.scootersoftware.com/ It's not free or OSS, though. There's also a plugin for Notepad++ that does something similar: http://sourceforge.net/projects/npp-compare/ This is free, of course. Thanks! Allie -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Wilhelmina Randtke Sent: Tuesday, April 23, 2013 4:24 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Tool to highlight differences in two files I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
1. Create a gist (https://gist.github.com/{yourUserName}) with the original document. 2. Click edit and paste in the new document. 3. Click Revisions link. You'll see all the changes. On 4/23/13 4:24 PM, Wilhelmina Randtke wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
Same trick will work with google docs. Or just use the compare two versions of a document feature in Word... On Tue, Apr 23, 2013 at 2:14 PM, Shaun Ellis sha...@princeton.edu wrote: 1. Create a gist (https://gist.github.com/{**yourUserName}https://gist.github.com/%7ByourUserName%7D) with the original document. 2. Click edit and paste in the new document. 3. Click Revisions link. You'll see all the changes.
Re: [CODE4LIB] Tool to highlight differences in two files
Hi, I'm not sure if you're really looking for a diff tool, so I'll just shout an answer to a question that I think you might be asking. I use a variation of the script posted here: http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes for watching a web page for changes. I mostly only ever use this for watching for new artifacts to appear in Maven Central (because refreshing a web page is pretty dull work). Hope this helps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Do you love it? Do you hate it? There it is, the way you made it. --Frank Zappa On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
That helps a lot, because it's for websites which is what I want to compare. I am looking for changes in a site, and I have some archives, but tools for merging code are too labor intensive and don't give a good visual report that I can show to a supervisor. This is good moving forward, but doesn't cover historical pages. I was hoping for something where I could call up two pages and get a visual display of differences for the display version of html, not the code. -Wilhelmina On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I'm not sure if you're really looking for a diff tool, so I'll just shout an answer to a question that I think you might be asking. I use a variation of the script posted here: http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes for watching a web page for changes. I mostly only ever use this for watching for new artifacts to appear in Maven Central (because refreshing a web page is pretty dull work). Hope this helps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Do you love it? Do you hate it? There it is, the way you made it. --Frank Zappa On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
I'm a real newbie when it comes to such things, but what is the possibility of using AJAX to cycle through each version of a page? I'm guessing that dynamic elements in each of those pages might pose a problem (that I'll leave up to you coding experts to answer). If it's visuals you're really looking to get, it certainly would be a cool effect to have the pages fade in and out. And a timeline slider would be nice. John Lolis Information Technology Manager White Plains Public Library 100 Martine Avenue White Plains, NY 10601 email: jlo...@wppl.lib.ny.us tel: 1.914.422.1497 fax: 1.914.422.1452 http://whiteplainslibrary.org/ *¿* - On 4/23/2013 at 6:18 PM, in message CAJGGFoo0FOwAUnXSAP=pYOD0o6mOXWWLG7p=fmcfkd6mbqx...@mail.gmail.com, Wilhelmina Randtke rand...@gmail.com wrote: That helps a lot, because it's for websites which is what I want to compare. I am looking for changes in a site, and I have some archives, but tools for merging code are too labor intensive and don't give a good visual report that I can show to a supervisor. This is good moving forward, but doesn't cover historical pages. I was hoping for something where I could call up two pages and get a visual display of differences for the display version of html, not the code. -Wilhelmina On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I'm not sure if you're really looking for a diff tool, so I'll just shout an answer to a question that I think you might be asking. I use a variation of the script posted here: http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes for watching a web page for changes. I mostly only ever use this for watching for new artifacts to appear in Maven Central (because refreshing a web page is pretty dull work). Hope this helps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Do you love it? Do you hate it? There it is, the way you made it. --Frank Zappa On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
Automating your favourite browser to load and screenshot each version and then using http://www.imagemagick.org/Usage/compare/ should work. Note that this will also catch the scenario where someone has changed the page by changing an image on the page. cheers stuart On 24/04/13 10:18, Wilhelmina Randtke wrote: That helps a lot, because it's for websites which is what I want to compare. I am looking for changes in a site, and I have some archives, but tools for merging code are too labor intensive and don't give a good visual report that I can show to a supervisor. This is good moving forward, but doesn't cover historical pages. I was hoping for something where I could call up two pages and get a visual display of differences for the display version of html, not the code. -Wilhelmina On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I'm not sure if you're really looking for a diff tool, so I'll just shout an answer to a question that I think you might be asking. I use a variation of the script posted here: http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes for watching a web page for changes. I mostly only ever use this for watching for new artifacts to appear in Maven Central (because refreshing a web page is pretty dull work). Hope this helps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Do you love it? Do you hate it? There it is, the way you made it. --Frank Zappa On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke -- Stuart Yeates Library Technology Services http://www.victoria.ac.nz/library/
Re: [CODE4LIB] Tool to highlight differences in two files
There's a list here that may be more along the lines of what you're seeking. http://webapps.stackexchange.com/questions/11547/diff-for-websites Genny Engel Sonoma County Library gen...@sonoma.lib.ca.us 707 545-0831 x1581 www.sonomalibrary.org -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Wilhelmina Randtke Sent: Tuesday, April 23, 2013 3:19 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Tool to highlight differences in two files That helps a lot, because it's for websites which is what I want to compare. I am looking for changes in a site, and I have some archives, but tools for merging code are too labor intensive and don't give a good visual report that I can show to a supervisor. This is good moving forward, but doesn't cover historical pages. I was hoping for something where I could call up two pages and get a visual display of differences for the display version of html, not the code. -Wilhelmina On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I'm not sure if you're really looking for a diff tool, so I'll just shout an answer to a question that I think you might be asking. I use a variation of the script posted here: http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes for watching a web page for changes. I mostly only ever use this for watching for new artifacts to appear in Maven Central (because refreshing a web page is pretty dull work). Hope this helps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Do you love it? Do you hate it? There it is, the way you made it. --Frank Zappa On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote: I would like to compare versions of a website scraped at different times to see what paragraphs on a page have changed. Does anyone here know of a tool for holding two files side by side and noting what is the same and what is different between the files? It seems like any simple script to note differences in two strings of text would work, but I don't know a tool to use. -Wilhelmina Randtke
Re: [CODE4LIB] Tool to highlight differences in two files
On Apr 23, 2013, at 8:12 PM, Genny Engel wrote: There's a list here that may be more along the lines of what you're seeking. http://webapps.stackexchange.com/questions/11547/diff-for-websites Hmm ... I guess I should actually accept the answer as it was the only one ever given. -Joe