Doh! - Sean
> On Jun 28, 2016, at 12:56 PM, Michael B. Smith <[email protected]> wrote: > > As in beer? J > > From: [email protected] [mailto:[email protected]] > On Behalf Of Micheal Espinola Jr > Sent: Tuesday, June 28, 2016 4:45 PM > To: [email protected] > Subject: Re: [NTSysADM] Compare two large lists > > With a dataset this large, I also recommend duff. > > -- > Espi > > > On Tue, Jun 28, 2016 at 1:28 PM, Michael B. Smith <[email protected]> > wrote: > diff. > > If you want a GUI, WinDiff. > > -----Original Message----- > From: [email protected] [mailto:[email protected]] > On Behalf Of Kurt Buff > Sent: Tuesday, June 28, 2016 3:16 PM > To: ntsysadm > Subject: Re: [NTSysADM] Compare two large lists > > Of the tools I'm aware of. BeyondCompare (http://scootersoftware.com/) is > probably your best bet, but WinMerge (http://winmerge.org/) and some other > tools might handle this, especially if the data is sorted. > > I'm sure others can make recommendations as well. > > If you've got time and a lot of RAM, PowerShell can do this as well - take > each entry in the short list, compare against the large list, write it to a > file if there's a match. That's a *very* slow algorithm, but it will work. > > > > On Tue, Jun 28, 2016 at 11:02 AM, Richard Stovall <[email protected]> wrote: > > Not necessarily Windows-related. > > > > I need to compare a list of about 300,000 file hashes against a larger > > list of ~30,000,000 and find ones that are represented in both data sets. > > > > I'm not a database guy, nor have I ever played one on TeeVee. > > > > Any ideas about how to go about this with standard/free tools in > > Windows or Linux? > > > > TIA, > > RS > > >

