Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-24 Thread Taylor, Nicholas A.
Hi Wilhelmina, you might want to take a look at the work of Myriam Ben Saad: 
http://www-poleia.lip6.fr/~bensaadm/. She has a few papers describing extending 
a vision-based page segmentation algorithm (VIPS) to support webpage change 
analysis (Vi-DIFF). Here's one such paper from 2010: 
http://www-poleia.lip6.fr/~bensaadm/DEXA2010.pdf. The algorithm detects both 
visual changes and changes in the DOM.

Unfortunately, I haven't yet seen any links to tools or production code using 
the approach.

~Nicholas

--
https://twitter.com/nullhandle


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-24 Thread Rogers, Nathan
If you are on OS X then BBEdit will do this for you.

On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote:

I would like to compare versions of a website scraped at different times
to
see what paragraphs on a page have changed.  Does anyone here know of a
tool for holding two files side by side and noting what is the same and
what is different between the files?

It seems like any simple script to note differences in two strings of text
would work, but I don't know a tool to use.

-Wilhelmina Randtke


[CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Wilhelmina Randtke
I would like to compare versions of a website scraped at different times to
see what paragraphs on a page have changed.  Does anyone here know of a
tool for holding two files side by side and noting what is the same and
what is different between the files?

It seems like any simple script to note differences in two strings of text
would work, but I don't know a tool to use.

-Wilhelmina Randtke


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Jim DelRosso
The one resource that came immediately to mind was Juxta:
http://www.juxtasoftware.org/juxta-commons/

Thanks!

Jim

*Jim DelRosso, MPA, MSLIS
Digital Projects Coordinator*
*Hospitality, Labor, and Management Library*
Catherwood Library
ILR School
Cornell University
239D Ives Hall
Ithaca, NY 14853
p 607.255.8688
f 607.255.9641
e jd...@cornell.edu
www.ilr.cornell.edu
*Advancing the World of Work*


On Tue, Apr 23, 2013 at 4:24 PM, Wilhelmina Randtke rand...@gmail.comwrote:

 I would like to compare versions of a website scraped at different times to
 see what paragraphs on a page have changed.  Does anyone here know of a
 tool for holding two files side by side and noting what is the same and
 what is different between the files?

 It seems like any simple script to note differences in two strings of text
 would work, but I don't know a tool to use.

 -Wilhelmina Randtke



Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Alevtina Verbovetskaya
I've recently begun to use Beyond Compare: http://www.scootersoftware.com/ It's 
not free or OSS, though.

There's also a plugin for Notepad++ that does something similar: 
http://sourceforge.net/projects/npp-compare/ This is free, of course.

Thanks!
Allie


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of 
Wilhelmina Randtke
Sent: Tuesday, April 23, 2013 4:24 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] Tool to highlight differences in two files

I would like to compare versions of a website scraped at different times to see 
what paragraphs on a page have changed.  Does anyone here know of a tool for 
holding two files side by side and noting what is the same and what is 
different between the files?

It seems like any simple script to note differences in two strings of text 
would work, but I don't know a tool to use.

-Wilhelmina Randtke


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Terry Brady
WinMerge is a great diff tool, and it is easy to use.


On Tue, Apr 23, 2013 at 4:29 PM, Jim DelRosso jd...@cornell.edu wrote:

 The one resource that came immediately to mind was Juxta:
 http://www.juxtasoftware.org/juxta-commons/

 Thanks!

 Jim

 *Jim DelRosso, MPA, MSLIS
 Digital Projects Coordinator*
 *Hospitality, Labor, and Management Library*
 Catherwood Library
 ILR School
 Cornell University
 239D Ives Hall
 Ithaca, NY 14853
 p 607.255.8688
 f 607.255.9641
 e jd...@cornell.edu
 www.ilr.cornell.edu
 *Advancing the World of Work*


 On Tue, Apr 23, 2013 at 4:24 PM, Wilhelmina Randtke rand...@gmail.com
 wrote:

  I would like to compare versions of a website scraped at different times
 to
  see what paragraphs on a page have changed.  Does anyone here know of a
  tool for holding two files side by side and noting what is the same and
  what is different between the files?
 
  It seems like any simple script to note differences in two strings of
 text
  would work, but I don't know a tool to use.
 
  -Wilhelmina Randtke
 




-- 
Terry Brady
Applications Programmer Analyst
Lauinger Information Technology
202-687-7053


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Monica Rivero

Hi Wilhelmina,

We've used oXygen and Text Wrangler (but only for macs).

regards,
Monica

On 4/23/2013 3:24 PM, Wilhelmina Randtke wrote:

I would like to compare versions of a website scraped at different times to
see what paragraphs on a page have changed.  Does anyone here know of a
tool for holding two files side by side and noting what is the same and
what is different between the files?

It seems like any simple script to note differences in two strings of text
would work, but I don't know a tool to use.

-Wilhelmina Randtke



Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Alexander Duryee
The absolute simplest way to do this would be to fire up a terminal
(OSX/Linux) and:

diff page1.html page2.html | less

Unfortunately, this will also catch changes made in other markup, and
may or may not be terribly readable.

On Tue, Apr 23, 2013 at 4:31 PM, Alevtina Verbovetskaya
alevtina.verbovetsk...@mail.cuny.edu wrote:
 I've recently begun to use Beyond Compare: http://www.scootersoftware.com/ 
 It's not free or OSS, though.

 There's also a plugin for Notepad++ that does something similar: 
 http://sourceforge.net/projects/npp-compare/ This is free, of course.

 Thanks!
 Allie


 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of 
 Wilhelmina Randtke
 Sent: Tuesday, April 23, 2013 4:24 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: [CODE4LIB] Tool to highlight differences in two files

 I would like to compare versions of a website scraped at different times to 
 see what paragraphs on a page have changed.  Does anyone here know of a tool 
 for holding two files side by side and noting what is the same and what is 
 different between the files?

 It seems like any simple script to note differences in two strings of text 
 would work, but I don't know a tool to use.

 -Wilhelmina Randtke


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Francis Kayiwa
On Tue, Apr 23, 2013 at 03:24:16PM -0500, Wilhelmina Randtke wrote:
 I would like to compare versions of a website scraped at different times to
 see what paragraphs on a page have changed.  Does anyone here know of a
 tool for holding two files side by side and noting what is the same and
 what is different between the files?
 
 It seems like any simple script to note differences in two strings of text
 would work, but I don't know a tool to use.

diff --side-by-side file1 file2

./fxk

 
 -Wilhelmina Randtke
 

-- 
I went to the race track once and bet on a horse that was so good that
it took seven others to beat him!


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Devon
VimDiff is quite nice.

http://vimdoc.sourceforge.net/htmldoc/diff.html



On Tue, Apr 23, 2013 at 4:43 PM, Francis Kayiwa kay...@uic.edu wrote:

 On Tue, Apr 23, 2013 at 03:24:16PM -0500, Wilhelmina Randtke wrote:
  I would like to compare versions of a website scraped at different times
 to
  see what paragraphs on a page have changed.  Does anyone here know of a
  tool for holding two files side by side and noting what is the same and
  what is different between the files?
 
  It seems like any simple script to note differences in two strings of
 text
  would work, but I don't know a tool to use.

 diff --side-by-side file1 file2

 ./fxk

 
  -Wilhelmina Randtke
 

 --
 I went to the race track once and bet on a horse that was so good that
 it took seven others to beat him!




-- 
Sent from my GMail account.


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Joe Hourcle
On Apr 23, 2013, at 4:37 PM, Alexander Duryee wrote:

 The absolute simplest way to do this would be to fire up a terminal
 (OSX/Linux) and:
 
 diff page1.html page2.html | less
 
 Unfortunately, this will also catch changes made in other markup, and
 may or may not be terribly readable.

At the very least, I'd suggest adding a '-b' which will ignore changes to 
whitespace.

Also see:

http://www.w3.org/wiki/HtmlDiff

-Joe


 On Tue, Apr 23, 2013 at 4:31 PM, Alevtina Verbovetskaya
 alevtina.verbovetsk...@mail.cuny.edu wrote:
 I've recently begun to use Beyond Compare: http://www.scootersoftware.com/ 
 It's not free or OSS, though.
 
 There's also a plugin for Notepad++ that does something similar: 
 http://sourceforge.net/projects/npp-compare/ This is free, of course.
 
 Thanks!
 Allie
 
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of 
 Wilhelmina Randtke
 Sent: Tuesday, April 23, 2013 4:24 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: [CODE4LIB] Tool to highlight differences in two files
 
 I would like to compare versions of a website scraped at different times to 
 see what paragraphs on a page have changed.  Does anyone here know of a tool 
 for holding two files side by side and noting what is the same and what is 
 different between the files?
 
 It seems like any simple script to note differences in two strings of text 
 would work, but I don't know a tool to use.
 
 -Wilhelmina Randtke


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Shaun Ellis
1. Create a gist (https://gist.github.com/{yourUserName}) with the 
original document.

2. Click edit and paste in the new document.
3. Click Revisions link.

You'll see all the changes.

On 4/23/13 4:24 PM, Wilhelmina Randtke wrote:

I would like to compare versions of a website scraped at different times to
see what paragraphs on a page have changed.  Does anyone here know of a
tool for holding two files side by side and noting what is the same and
what is different between the files?

It seems like any simple script to note differences in two strings of text
would work, but I don't know a tool to use.

-Wilhelmina Randtke


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Kyle Banerjee
Same trick will work with google docs. Or just use the compare two versions
of a document feature in Word...

On Tue, Apr 23, 2013 at 2:14 PM, Shaun Ellis sha...@princeton.edu wrote:

 1. Create a gist 
 (https://gist.github.com/{**yourUserName}https://gist.github.com/%7ByourUserName%7D)
 with the original document.
 2. Click edit and paste in the new document.
 3. Click Revisions link.

 You'll see all the changes.




Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Pottinger, Hardy J.
Hi, I'm not sure if you're really looking for a diff tool, so I'll just
shout an answer to a question that I think you might be asking. I use a
variation of the script posted here:

http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes


for watching a web page for changes. I mostly only ever use this for
watching for new artifacts to appear in Maven Central (because refreshing
a web page is pretty dull work).

Hope this helps.

--
HARDY POTTINGER pottinge...@umsystem.edu
University of Missouri Library Systems
http://lso.umsystem.edu/~pottingerhj/
https://MOspace.umsystem.edu/
Do you love it? Do you hate it? There it is, the way you made it.
--Frank Zappa





On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote:

I would like to compare versions of a website scraped at different times
to
see what paragraphs on a page have changed.  Does anyone here know of a
tool for holding two files side by side and noting what is the same and
what is different between the files?

It seems like any simple script to note differences in two strings of text
would work, but I don't know a tool to use.

-Wilhelmina Randtke


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Wilhelmina Randtke
That helps a lot, because it's for websites which is what I want to compare.

I am looking for changes in a site, and I have some archives, but tools for
merging code are too labor intensive and don't give a good visual report
that I can show to a supervisor.  This is good moving forward, but doesn't
cover historical pages.

I was hoping for something where I could call up two pages and get a visual
display of differences for the display version of html, not the code.

-Wilhelmina

On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I'm not sure if you're really looking for a diff tool, so I'll just
 shout an answer to a question that I think you might be asking. I use a
 variation of the script posted here:

 http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes


 for watching a web page for changes. I mostly only ever use this for
 watching for new artifacts to appear in Maven Central (because refreshing
 a web page is pretty dull work).

 Hope this helps.

 --
 HARDY POTTINGER pottinge...@umsystem.edu
 University of Missouri Library Systems
 http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Do you love it? Do you hate it? There it is, the way you made it.
 --Frank Zappa





 On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote:

 I would like to compare versions of a website scraped at different times
 to
 see what paragraphs on a page have changed.  Does anyone here know of a
 tool for holding two files side by side and noting what is the same and
 what is different between the files?
 
 It seems like any simple script to note differences in two strings of text
 would work, but I don't know a tool to use.
 
 -Wilhelmina Randtke



Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread John Lolis
I'm a real newbie when it comes to such things, but what is the
possibility of using AJAX to cycle through each version of a page?  I'm
guessing that dynamic elements in each of those pages might pose a
problem (that I'll leave up to you coding experts to answer).  If it's
visuals you're really looking to get, it certainly would be a cool
effect to have the pages fade in and out.  And a timeline slider would
be nice.
 
 
John Lolis
Information Technology Manager
White Plains Public Library
100 Martine Avenue
White Plains, NY 10601

email: jlo...@wppl.lib.ny.us 
tel: 1.914.422.1497
fax: 1.914.422.1452

http://whiteplainslibrary.org/
 
*¿*
  -


 On 4/23/2013 at 6:18 PM, in message
CAJGGFoo0FOwAUnXSAP=pYOD0o6mOXWWLG7p=fmcfkd6mbqx...@mail.gmail.com,
Wilhelmina Randtke rand...@gmail.com wrote:
That helps a lot, because it's for websites which is what I want to
compare.

I am looking for changes in a site, and I have some archives, but tools
for
merging code are too labor intensive and don't give a good visual
report
that I can show to a supervisor.  This is good moving forward, but
doesn't
cover historical pages.

I was hoping for something where I could call up two pages and get a
visual
display of differences for the display version of html, not the code.

-Wilhelmina

On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I'm not sure if you're really looking for a diff tool, so I'll
just
 shout an answer to a question that I think you might be asking. I use
a
 variation of the script posted here:


http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes



 for watching a web page for changes. I mostly only ever use this for
 watching for new artifacts to appear in Maven Central (because
refreshing
 a web page is pretty dull work).

 Hope this helps.

 --
 HARDY POTTINGER pottinge...@umsystem.edu
 University of Missouri Library Systems
 http://lso.umsystem.edu/~pottingerhj/ 
 https://MOspace.umsystem.edu/ 
 Do you love it? Do you hate it? There it is, the way you made it.
 --Frank Zappa





 On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote:

 I would like to compare versions of a website scraped at different
times
 to
 see what paragraphs on a page have changed.  Does anyone here know
of a
 tool for holding two files side by side and noting what is the same
and
 what is different between the files?
 
 It seems like any simple script to note differences in two strings
of text
 would work, but I don't know a tool to use.
 
 -Wilhelmina Randtke



Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread stuart yeates
Automating your favourite browser to load and screenshot each version 
and then using http://www.imagemagick.org/Usage/compare/ should work.


Note that this will also catch the scenario where someone has changed 
the page by changing an image on the page.


cheers
stuart

On 24/04/13 10:18, Wilhelmina Randtke wrote:

That helps a lot, because it's for websites which is what I want to compare.

I am looking for changes in a site, and I have some archives, but tools for
merging code are too labor intensive and don't give a good visual report
that I can show to a supervisor.  This is good moving forward, but doesn't
cover historical pages.

I was hoping for something where I could call up two pages and get a visual
display of differences for the display version of html, not the code.

-Wilhelmina

On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:


Hi, I'm not sure if you're really looking for a diff tool, so I'll just
shout an answer to a question that I think you might be asking. I use a
variation of the script posted here:

http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes


for watching a web page for changes. I mostly only ever use this for
watching for new artifacts to appear in Maven Central (because refreshing
a web page is pretty dull work).

Hope this helps.

--
HARDY POTTINGER pottinge...@umsystem.edu
University of Missouri Library Systems
http://lso.umsystem.edu/~pottingerhj/
https://MOspace.umsystem.edu/
Do you love it? Do you hate it? There it is, the way you made it.
--Frank Zappa





On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote:


I would like to compare versions of a website scraped at different times
to
see what paragraphs on a page have changed.  Does anyone here know of a
tool for holding two files side by side and noting what is the same and
what is different between the files?

It seems like any simple script to note differences in two strings of text
would work, but I don't know a tool to use.

-Wilhelmina Randtke







--
Stuart Yeates
Library Technology Services http://www.victoria.ac.nz/library/


Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Genny Engel
There's a list here that may be more along the lines of what you're seeking.

http://webapps.stackexchange.com/questions/11547/diff-for-websites


Genny Engel
Sonoma County Library
gen...@sonoma.lib.ca.us
707 545-0831 x1581
www.sonomalibrary.org

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of 
Wilhelmina Randtke
Sent: Tuesday, April 23, 2013 3:19 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Tool to highlight differences in two files

That helps a lot, because it's for websites which is what I want to compare.

I am looking for changes in a site, and I have some archives, but tools for
merging code are too labor intensive and don't give a good visual report
that I can show to a supervisor.  This is good moving forward, but doesn't
cover historical pages.

I was hoping for something where I could call up two pages and get a visual
display of differences for the display version of html, not the code.

-Wilhelmina

On Tue, Apr 23, 2013 at 5:14 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I'm not sure if you're really looking for a diff tool, so I'll just
 shout an answer to a question that I think you might be asking. I use a
 variation of the script posted here:

 http://stackoverflow.com/questions/1494488/watch-a-web-page-for-changes


 for watching a web page for changes. I mostly only ever use this for
 watching for new artifacts to appear in Maven Central (because refreshing
 a web page is pretty dull work).

 Hope this helps.

 --
 HARDY POTTINGER pottinge...@umsystem.edu
 University of Missouri Library Systems
 http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Do you love it? Do you hate it? There it is, the way you made it.
 --Frank Zappa





 On 4/23/13 3:24 PM, Wilhelmina Randtke rand...@gmail.com wrote:

 I would like to compare versions of a website scraped at different times
 to
 see what paragraphs on a page have changed.  Does anyone here know of a
 tool for holding two files side by side and noting what is the same and
 what is different between the files?
 
 It seems like any simple script to note differences in two strings of text
 would work, but I don't know a tool to use.
 
 -Wilhelmina Randtke



Re: [CODE4LIB] Tool to highlight differences in two files

2013-04-23 Thread Joe Hourcle
On Apr 23, 2013, at 8:12 PM, Genny Engel wrote:

 There's a list here that may be more along the lines of what you're seeking.
 
 http://webapps.stackexchange.com/questions/11547/diff-for-websites


Hmm ... I guess I should actually accept the answer as it was the only one ever 
given.

-Joe