Re: [CODE4LIB] Comparing Barcodes Between 2 Files?

2017-11-22 Thread Renate Morgenstern
Thanks. I am using Linux. Will test it. RenateAm 23.11.2017 00:22 schrieb Kyle Banerjee : > > Howdy Renate, > > I had sent it directly. > > cat file1 file2 |sort |uniq -d > duplicate_barcodes > > For Windows users for whom an emulation layer such as VirtualBox or Cygwin > is overkill and who

Re: [CODE4LIB] Comparing Barcodes Between 2 Files?

2017-11-22 Thread Kyle Banerjee
Howdy Renate, I had sent it directly. cat file1 file2 |sort |uniq -d > duplicate_barcodes For Windows users for whom an emulation layer such as VirtualBox or Cygwin is overkill and who encounter issues implementing Gnuwin, there are two options: 1) Windows 10: Enable linux subsystem (it's a Win

Re: [CODE4LIB] Tokenizers for scientific corpora?

2017-11-22 Thread Thomas Krichel
Andromeda Yelton writes > I'm doing a project to prototype machine-learning-driven interfaces > to MIT's thesis collection, and my preprocessing step would really > benefit from a tokenizer that is aware of common multi-word > scientific tokens In my machine-learning work with the RePEc digit

[CODE4LIB] Job: IT Librarian at the University of Denver

2017-11-22 Thread Jack Maness
Hello everyone (with apologies for cross-posting), We are looking for an Information Technologies Librarian to head our technology services at the University of Denver Libraries. We are a mid-sized, innovative

Re: [CODE4LIB] Comparing Barcodes Between 2 Files?

2017-11-22 Thread Renate Morgenstern
Good day,I could not see the suggested solution by Kyle Bannerjee with the cat command.  Possibly he sent it to you directly?If you still have it please forward it to me.ThanksRenate Morgenstern > Thanks, everyone, for taking the time to reply to my question!  I > liked the > simplicity of Kyle Ban

[CODE4LIB] Job: Web Developer/Designer at University at Albany, SUNY

2017-11-22 Thread Code4Lib Jobs
The University at Albany Libraries (State University of New York, Albany, NY) seek a service-oriented, creative, collaborative individual to serve as Web Developer/Designer. The successful candidate will have daily responsibility for organizing, maintaining, updating, and improving the Libraries

[CODE4LIB] Tokenizers for scientific corpora?

2017-11-22 Thread Andromeda Yelton
I'm doing a project to prototype machine-learning-driven interfaces to MIT's thesis collection, and my preprocessing step would really benefit from a tokenizer that is aware of common multi-word scientific tokens (e.g. "inertial mass" should definitely be one token, not two). My somewhat cursory r