[CODE4LIB] software package for Elec. Theses/dissertations

2009-01-07 Thread Han, Yan
Hello, Colleagues, As ProQuest/UMI switched its delivery platform for Electronic Theses and dissertations(ETD), I have developed a small software package to process ETD. The software package does: 1. Unzip ProQuest/UMI ETD delivery Zipped files, and create one directory per ETD. 2.

[CODE4LIB] ETD package for ProQuest/UMI old and new delivery platforms

2009-01-21 Thread Han, Yan
Hello, All, As mentioned before, I have received quite a few inquiries about the packages. I have created a web page so that you can download them. I have also made some fixes on the package. The software package does: * Unzip ProQuest/UMI ETD delivery Zipped files, and create one

Re: [CODE4LIB] MARC 21 and MODS

2009-01-29 Thread Han, Yan
I clicked 2 URLs, and they are broken. What happened? 404 Not Found There is no SKOS Concept, ConceptScheme, or Collection instance in the registry available using this resource URI. -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Tim Cornwell

Re: [CODE4LIB] Is there a utility to open a folder of many pdfs and determine if each one will open? (eom)

2009-01-29 Thread Han, Yan
try PDFBox. It can index PDF documents. -Original Message- From: Code for Libraries on behalf of Thomas Dowling Sent: Wed 1/28/2009 2:37 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Is there a utility to open a folder of many pdfs and determine if each one will open? (eom)

[CODE4LIB] Linux tools for making PDFs

2009-02-03 Thread Han, Yan
Hello, Do you know a tool running under Linux to make PDFs from images? I use Adobe Acrobat professional in Windows to create PDFs from image files. However, Acrobat does not handle image files with east Asian characters. Yan

[CODE4LIB] OCR engine for Persian/Dari

2009-02-03 Thread Han, Yan
Hello, Do you know an OCR engine for Persian/Dari ? If so, what is the accurate rate? Thanks, Yan

Re: [CODE4LIB] OCR engine for Persian/Dari

2009-02-04 Thread Han, Yan
Mark, Many thanks for your input. This is one of the packages that I am thinking of. Good to know its accuracy. Yan -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Mark Jordan Sent: Tuesday, February 03, 2009 5:36 PM To:

Re: [CODE4LIB] Something completely different

2009-04-06 Thread Han, Yan
Well, the future of ILS is to use general computing standards without making library's own. Essentially, from a computing theory view, a graph is the way to present all the info (i.e. a graph can represent a tree, or a line. When you look at MARC, it is a linear computing model.) Graph is

Re: [CODE4LIB] You got it!!!!! Re: [CODE4LIB] Something completely different

2009-04-10 Thread Han, Yan
Bill and Peter, Very nice posts. XML, RDF, MARC and DC are all different ways to present information in a way (of course, XML, RDF, and DC are easier to read/processed by machine). However, down the fundamentals, I think that it can go deeper, basically data structure and algorithms making

Re: [CODE4LIB] Recommend book scanner?

2009-05-01 Thread Han, Yan
That is right. In addition, for certain printing (gold seal), digital camera delivers better result than scanners. -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Jonathan Rochkind Sent: Friday, May 01, 2009 2:38 PM To:

Re: [CODE4LIB] Recommend book scanner?

2009-05-04 Thread Han, Yan
The National Archives has the guideline which describes target that you can use for scanning comparison. There are other targets used in other books/articles. I suggest that you check the National Archives' guidelines. http://www.archives.gov/preservation/technical/guidelines.html -Original

Re: [CODE4LIB] Digital imaging questions

2009-06-18 Thread Han, Yan
There are two things about archive images at least I can think of this moment: 1. the resolution: diff size/materials require different resolution. There is no one-size-fit-all. To make a judgment, I would like to know the image (color?), the size of the material?, 2. the file format: TIFF is

Re: [CODE4LIB] Assigning DOI for local content

2009-11-18 Thread Han, Yan
Currently DOI uses Handle (technology) with it social framework (i.e. administrative body to manage DOI). In technical sense, PURL is not going to last long. Crossref handles DOI registration in U.S. In Europe and Aisa, they have other organizations to handle it. DOI is also currently going

Re: [CODE4LIB] Assigning DOI for local content

2009-11-19 Thread Han, Yan
for local content On Wed, Nov 18, 2009 at 12:19 PM, Han, Yan h...@u.library.arizona.edu wrote: Currently DOI uses Handle (technology) with it social framework (i.e. administrative body to manage DOI). In technical sense, PURL is not going to last long. I'm not entirely sure what

[CODE4LIB] OCR for handwritten pages

2010-01-13 Thread Han, Yan
Hello, Colleagues, Does anyone know/use any OCR software working on handwritten pages? or at least think it is better than hiring a student key-in. I know these OCR software such as ABBYY, but they do not work on handwriting. Thanks, Yan

[CODE4LIB] Amazon EC2 ports: only 80 and 8080?

2010-07-06 Thread Han, Yan
Hello, Currently we would like to have Amazon EC2 node hosting 2 applications: DSpace and Koha (so that we need 4 ports). However, it seems to me that only port 80 and 8080 are available. Any other ports are not accessible from outside. Anyone has similar experience and knows how to open other

Re: [CODE4LIB] DL Systems (allowing search within documents and access restrictions)?

2010-10-20 Thread Han, Yan
I would think DSpace, Fedora, and Eprint. DSpace is fairly easy to implement, which has embargo support in 1.6 (https://wiki.duraspace.org/display/DSTEST/Embargo ). I have an article comparing DSpace and Fedora, but was written 6 years ago. DSpace has not been changed much, but Fedora is a

Re: [CODE4LIB] DL Systems (allowing search within documents and access restrictions)?

2010-10-20 Thread Han, Yan
DSpace does Full-text search, you need to turn on the configuration file. See UAL http://arizona.openrepository.com/arizona/ Yan -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Deng, Sai Sent: Wednesday, October 20, 2010 2:14 PM To:

Re: [CODE4LIB] LAMP Hosting service that supports php_yaz?

2011-03-07 Thread Han, Yan
You can just buy a node from a variety of cloud providers such as Amazon EC2, Linode etc. (It is very easy to build anything you want). Yan -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Cindy Harper Sent: Sunday, March 06, 2011 10:54 AM

Re: [CODE4LIB] LAMP Hosting service that supports php_yaz?

2011-03-07 Thread Han, Yan
for you). Yan Han, Associate Librarian The University of Arizona Libraries Phone: (520)307-2823 Email: h...@u.library.arizona.edu From: Cindy Harper [mailto:char...@colgate.edu] Sent: Monday, March 07, 2011 11:18 AM To: Code for Libraries Cc: Han, Yan Subject: Re: [CODE4LIB] LAMP Hosting service

[CODE4LIB] III loading module cannot handle non-English characters

2013-01-22 Thread Han, Yan
Hello, We have problems using III loading module to load MARC files (.mrc) to our catalog. This is to use Data Exchange Load Electronic Records (itm). Basically non- English characters (French, Arabic ) will be changed to unknown symbols. The MARC files (.mrk and .mrc) are verified before

[CODE4LIB] : Persian Romanization table

2013-01-22 Thread Han, Yan
Hello, All, I have a project to deal with Persian materials. I have already uses Google Translate API to translate. Now I am looking for an API to transliterate /Romanize (NOT Translate) Persian to English (not English to Persian). In other words, I have Persian in, and English out. There is a

Re: [CODE4LIB] : Persian Romanization table

2013-04-17 Thread Han, Yan
Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Han, Yan Sent: Tuesday, January 22, 2013 5:31 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] : Persian Romanization table Hello, All, I have a project to deal with Persian materials. I have already uses Google

Re: [CODE4LIB] : Persian Romanization table

2013-04-19 Thread Han, Yan
, but carries a GPLv2 license--maybe useful in some testing, and see if it's worth expanding on the effort. Best, Charles Riley From: Han, Yan [h...@u.library.arizona.edu] Sent: Wednesday, April 17, 2013 8:14 PM To: Jacobs, Jane W; Code for Libraries

Re: [CODE4LIB] Amazon Glacier - tracking deposits

2015-04-09 Thread Han, Yan - (yhan)
Be aware of data transfer cost if you are using Glacier. Glacier is excellent choice for archive use, but you want to be sure these files shall not be accessed often. You shall consider the total cost of ownership including data transfer cost, which could be very expensive if you retrieve more

Re: [CODE4LIB] Strategy for assigning DOIs?

2016-02-09 Thread Han, Yan - (yhan)
Hi, Jason, I strongly suggest to separate your DOI namespace/naming schema to be totally independent of your choice of repository/system. DOI is an infrastructure thing, and the main reason behind of assigning DOI is for persistency and permanency. At some point any repository system will go

[CODE4LIB]

2016-02-08 Thread Han, Yan - (yhan)
Yes. Use iText or PDFBox These are common PDF libraries. On 2/6/16, 2:24 PM, "Code for Libraries on behalf of Andrew Cunningham" wrote: >Hi all, > >I am working with PDF files in some South Asian and South East Asian

[CODE4LIB] EMPLOYMENT OPPORTUNITY: Department Head, Office of Digital Innovation and Stewardship (ODIS)

2016-01-26 Thread Han, Yan - (yhan)
Please share the posting with interested parties. Tucson has mild winter and dry / warm summer. The person will be working with engaged and nice colleagues. EMPLOYMENT OPPORTUNITY Department Head, Office of Digital Innovation and Stewardship The University of Arizona Libraries, Digital