Re: [CODE4LIB] Open Source OPAC - VUFind Beta Released
On 7/20/07, Andrew Nagy [EMAIL PROTECTED] wrote: http://www.vufind.org/ Excellent stuff, and thanks for the open-source effort. Three things ; 1. Will there be efforts towards a development community outside your library? 2. http://www.vufind.org/demo/Record/56179 has serious problems in its similar items section. :) 3. If you scroll down a list of things and then do something that requires a login, only the top part of the page that's not in view has the action. The user sees nothing, and nothing happens. Apart from that, great stuff and, if you accept such, I'd love to participate in ways that I can. Kind regards, Alexander -- --- Project Wrangler, SOA, Information Alchemist, UX, RESTafarian, Topic Maps -- http://shelter.nu/blog/
Re: [CODE4LIB] Citation parsing?
On Jul 18, 2007, at 10:04 PM, Eric Hellman wrote: Also, even in (many) scholarly journals, editorial consistency is almost unbelievably poor -- lots of times, the rules just aren't followed. Punctuation gets missed, journal names (especially abbreviations!) are misspelled... and so on. Rule-based and heuristic systems are always going to have problems in those cases. Heuristics are perhaps the only way to deal with lack of consistent format. (i.e. a cluster of words including journal of is likely to contain a journal name) If you have a halfway decent journal name parser (such as the one in our openurl software) it already contains a large list of journal misspellings. In a lot of ways, I think the problem is fundamentally similar to identifying parts of speech in natural language (which has lots of the same ambiguities) -- and the same techniques that succeed there will probably yield the most robust results for citation parsing. Have people been able to do a decent job of identifying parts of speech in natural language? -- Eric Hellman, DirectorOCLC Openly Informatics Division [EMAIL PROTECTED] [EMAIL PROTECTED] 2 Broad St., Suite 208 tel 1-973-509-7800 fax 1-734-468-6216 Bloomfield, NJ 07003 http://openly.oclc.org/1cate/ 1 Click Access To Everything
[CODE4LIB] Cannot use windows search text inside .java .jsp or .bas files?
Is this a well known feature or something I've managed to bring on myself through an excess of customization? Try this: In the windows search tool specify All or part of file name: .java A word or phrase in the file: import Look in: some directory with java files I've tried this on three different work stations and the result has always been: Search Complete: No results to display Same thing happens searching for common statements inside .jsp and .bas files. PS: I also have search system files enabled, so they are not being skipped for that reason begin:vcard fn:Jeffrey Barnett n:Barnett;Jeffrey org:Yale University Library;Integrated Library Technical Services adr;dom:;;;New Haven;CT;06520-8240 email;internet:[EMAIL PROTECTED] title:Sr. Research Analyst tel;work:(203) 432-1752 x-mozilla-html:FALSE version:2.1 end:vcard
Re: [CODE4LIB] Cannot use windows search text inside .java .jsp or .bas files?
Yes, I know I real programmers use grep ;-) But I still want an explanation! Jeffrey Barnett wrote: Is this a well known feature or something I've managed to bring on myself through an excess of customization? Try this: In the windows search tool specify All or part of file name: .java A word or phrase in the file: import Look in: some directory with java files I've tried this on three different work stations and the result has always been: Search Complete: No results to display Same thing happens searching for common statements inside .jsp and .bas files. PS: I also have search system files enabled, so they are not being skipped for that reason begin:vcard fn:Jeffrey Barnett n:Barnett;Jeffrey org:Yale University Library;Integrated Library Technical Services adr;dom:;;;New Haven;CT;06520-8240 email;internet:[EMAIL PROTECTED] title:Sr. Research Analyst tel;work:(203) 432-1752 x-mozilla-html:FALSE version:2.1 end:vcard
Re: [CODE4LIB] Cannot use windows search text inside .java .jsp or .bas files?
A better link than the one I just sent, still not great http://support.microsoft.com/kb/309173. Original message Date: Fri, 20 Jul 2007 16:18:12 -0400 From: Joe Atzberger [EMAIL PROTECTED] Subject: Re: [CODE4LIB] Cannot use windows search text inside .java .jsp or .bas files? To: CODE4LIB@listserv.nd.edu I can corroborate your experience here. Search for filename *.java and get hits. View one of those .java files, copy a string out of it, and go back to the search. Search for filename *.java again, with contents matching the string you paste in. Get zero hits. Lame! Google Desktop search does the trick for me, however. Try that instead. -- joe atzberger On 7/20/07, Jeffrey Barnett [EMAIL PROTECTED] wrote: Yes, I know I real programmers use grep ;-) But I still want an explanation! Jeffrey Barnett wrote: Is this a well known feature or something I've managed to bring on myself through an excess of customization? Try this: In the windows search tool specify All or part of file name: .java A word or phrase in the file: import Look in: some directory with java files I've tried this on three different work stations and the result has always been: Search Complete: No results to display Same thing happens searching for common statements inside .jsp and .bas files. PS: I also have search system files enabled, so they are not being skipped for that reason
[CODE4LIB] Systems Administrator for Federal Library (Washington DC)
LAC is looking for Systems Administrator to work on contract at a federal library in the Washington DC area. The Administrator will provide all aspects of computer systems administration in a heterogeneous computing environment. Operating systems include Sun Solaris 9/10 and Windows (Windows 2003 Server and XP Professional). Responsibilities: * Provide operational support of the UNIX (SUN SOLARIS) and Microsoft Windows (including Active Directory)computing environment including hardware installations, upgrades, OS and application software installations and upgrades, security monitoring and patches, backup and recovery administration, maintaining users and peripheral equipment, troubleshooting system and application issues. * Provide LAN troubleshooting and support. * Provide technical recommendations for information systems operations in the UNIX (SUN SOLARIS) environment. Implement modifications to increase system capacity. Analyze the results of performance and systems management software and recommend changes to improve processing and utilization. * Perform all procedures necessary to ensure the safety of information systems assets and to protect or recover systems from intentional or inadvertent access or destruction. Maintain and update security information, and certification and accreditation packages to meet the agencies policies. * Function as an Alternate Computer Security Officer. * Provide PC technical support as needed (SW HW). * Participate in evaluation, selection, and installation of computer hardware and software packages; evaluate and make recommendations on state-of-the-art technology improvements * Maintain and develop system documentation, policies, and procedures that promote optimal use of computer resources Required Qualifications: * BS in Computer Science, Information Systems, or relevant technical discipline (equivalent experience may be considered) * 3+ years experience in systems administration. * Extensive knowledge of UNIX (SUN SOLARIS 9/10) operating systems. * Extensive knowledge in Microsoft Windows 2003 Server and Active Directory. * Extensive knowledge of backup, web server, and document software (e.g., Veritas Netbackup, Apache, Tomcat, Microsoft Office suite). * Excellent verbal and written communication skills. * Ability to manage diverse tasks and professionally interact with users and vendors and to work effectively and cooperatively on a team project. * Must be able to work varied schedules to accommodate system maintenance or upgrades. * Lifting and relocation of computer equipment (must be able to lift up to 50 lbs.). * Must be a U.S. Citizen. Desired Qualifications: * Systems Administration and Network/Security certifications desired. * Networking skills desired. * Knowledge of shell scripting, MySQL, PHP, Perl, SharePoint, proxy servers, and web analysis applications are desired * Knowledge of Macintosh OS X. * Knowledge of Library software such as SirsiDynix Unicorn, ILLiad, etc. Compensation / To Apply * Competitive compensation; * Please email resume and cover letter to Marcy Johnson, [EMAIL PROTECTED] Please include the job title in your resume and cover letter. EOE The information contained in this e-mail message is privileged, confidential, and protected from disclosure. If you are not the intended recipient, any dissemination or copying is strictly prohibited. If you think that you have received this e-mail message in error, please e-mail the sender.
Re: [CODE4LIB] Citation parsing?
On 7/20/07, Eric Hellman [EMAIL PROTECTED] wrote: Have people been able to do a decent job of identifying parts of speech in natural language? I think trying to import broad NLP findings into our narrower problem of citation parsing is not likely to be fruitful but on the other hand stealing their tools seems perfectly reasonable, and this group seems to be familiar with several. About 8 years ago, I made use of a parser-genator called ANTLR (ANother Tool For Language Recognition) that takes an EBNF grammar spec and builds a parser. Since then developers have improved the tool with some new versions and even a GUI development environment. The languages recognized in practice all seem to be well-defined programming languages, but if you wanted to roll your own (new) parser for citations, ANTLR might help. I think ANTLR satisfies Eric's first two crtiteria for flexibility and ease of extension and might be used to satisfy the third (broad contextual info). It now includes a kind of ability to back itself out of rule descent and try other alternatives in the tree if the static gramar fails. The license is BSD. Notably, it supports unicode and the new version does NOT require a pre-specified number of look-ahead tokens. And the userbase is fairly broad for such a specialized tool. This might be considered an incongruous solution inasmuch as you are asking for parser characteristics and I am recommending a parser generator that *could* produce the kind of parser you want. But I think that is appropriate for the task described. --joe