Re: [CODE4LIB] Strategy for assigning DOIs?

2016-02-09 Thread Han, Yan - (yhan)
Hi, Jason, I strongly suggest to separate your DOI namespace/naming schema to be totally independent of your choice of repository/system. DOI is an infrastructure thing, and the main reason behind of assigning DOI is for persistency and permanency. At some point any repository system will go aw

[CODE4LIB]

2016-02-08 Thread Han, Yan - (yhan)
Yes. Use iText or PDFBox These are common PDF libraries. On 2/6/16, 2:24 PM, "Code for Libraries on behalf of Andrew Cunningham" wrote: >Hi all, > >I am working with PDF files in some South Asian and South East Asian >languages. Each PDF has ActualText added for each tag in the PDF. Each P

[CODE4LIB] EMPLOYMENT OPPORTUNITY: Department Head, Office of Digital Innovation and Stewardship (ODIS)

2016-01-26 Thread Han, Yan - (yhan)
Please share the posting with interested parties. Tucson has mild winter and dry / warm summer. The person will be working with engaged and nice colleagues. EMPLOYMENT OPPORTUNITY Department Head, Office of Digital Innovation and Stewardship The University of Arizona Libraries, Digital Innov

[CODE4LIB] Employment OPPO: Librarian/Specialist, Metadata Services

2016-01-19 Thread Han, Yan - (yhan)
Hi, Please share the posting. The libraries is located in Tucson, AZ, a metro city with a small town feel. We have very nice weather in winter and not bad in summer. The person will be working with engaged and nice colleagues. Yan Position Title: Librarian/Specialist, Metadata Services Departm

Re: [CODE4LIB] Amazon Glacier - tracking deposits

2015-04-09 Thread Han, Yan - (yhan)
Be aware of data transfer cost if you are using Glacier. Glacier is excellent choice for archive use, but you want to be sure these files shall not be accessed often. You shall consider the total cost of ownership including data transfer cost, which could be very expensive if you retrieve more tha

Re: [CODE4LIB] : Persian Romanization table

2013-04-19 Thread Han, Yan
taset, but carries a GPLv2 license--maybe useful in some testing, and see if it's worth expanding on the effort. Best, Charles Riley ____ From: Han, Yan [h...@u.library.arizona.edu] Sent: Wednesday, April 17, 2013 8:14 PM To: Jacobs, Jane W; Code for

Re: [CODE4LIB] : Persian Romanization table

2013-04-17 Thread Han, Yan
23, 2013 6:28 AM To: Han, Yan Subject: RE: : Persian Romanization table Hi Yan, As per my message to the listserve, here are the config files for Urdu. If you do a Persian config file, I d love to get it and if possible add it to the MARC::Detrans site. Let me know if you want to

[CODE4LIB] : Persian Romanization table

2013-01-22 Thread Han, Yan
Hello, All, I have a project to deal with Persian materials. I have already uses Google Translate API to translate. Now I am looking for an API to transliterate /Romanize (NOT Translate) Persian to English (not English to Persian). In other words, I have Persian in, and English out. There is a R

[CODE4LIB] III loading module cannot handle non-English characters

2013-01-22 Thread Han, Yan
Hello, We have problems using III loading module to load MARC files (.mrc) to our catalog. This is to use "Data Exchange" > "Load Electronic Records (itm)". Basically non- English characters (French, Arabic ) will be changed to unknown symbols. The MARC files (.mrk and .mrc) are verified befo

Re: [CODE4LIB] Help with Chinese exchange librarian

2012-07-12 Thread Han, Yan
Hello, Paul, I am sure that I can help out. You or he can drop me an email h...@u.library.arizona.edu Yan Han The University of Arizona Libraries -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Paul Orkiszewski Sent: Thursday, July 12, 2012 8

[CODE4LIB] New Electronic Theses and Dissertations software for processing ProQuest /UMI ETD files

2011-10-27 Thread Han, Yan
Hello, All, The Electronic Theses and Dissertation software is to help library staff process ProQuest/UMI delivered ETD files. It has been updated with new GUI interface and updated code so that Windows users can use. It can also run under Linux. It is Java executable files called "etd.jar". Yo

Re: [CODE4LIB] LAMP Hosting service that supports php_yaz?

2011-03-07 Thread Han, Yan
for you). Yan Han, Associate Librarian The University of Arizona Libraries Phone: (520)307-2823 Email: h...@u.library.arizona.edu From: Cindy Harper [mailto:char...@colgate.edu] Sent: Monday, March 07, 2011 11:18 AM To: Code for Libraries Cc: Han, Yan Subject: Re: [CODE4LIB] LAMP Hosting service

Re: [CODE4LIB] LAMP Hosting service that supports php_yaz?

2011-03-07 Thread Han, Yan
You can just buy a node from a variety of cloud providers such as Amazon EC2, Linode etc. (It is very easy to build anything you want). Yan -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Cindy Harper Sent: Sunday, March 06, 2011 10:54 AM To

Re: [CODE4LIB] DL Systems (allowing search within documents and access restrictions)?

2010-10-20 Thread Han, Yan
DSpace does Full-text search, you need to turn on the configuration file. See UAL http://arizona.openrepository.com/arizona/ Yan -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Deng, Sai Sent: Wednesday, October 20, 2010 2:14 PM To: CODE4LIB@LI

Re: [CODE4LIB] DL Systems (allowing search within documents and access restrictions)?

2010-10-20 Thread Han, Yan
I would think DSpace, Fedora, and Eprint. DSpace is fairly easy to implement, which has embargo support in 1.6 (https://wiki.duraspace.org/display/DSTEST/Embargo ). I have an article comparing DSpace and Fedora, but was written 6 years ago. DSpace has not been changed much, but Fedora is a diffe

[CODE4LIB] Amazon EC2 ports: only 80 and 8080?

2010-07-06 Thread Han, Yan
Hello, Currently we would like to have Amazon EC2 node hosting 2 applications: DSpace and Koha (so that we need 4 ports). However, it seems to me that only port 80 and 8080 are available. Any other ports are not accessible from outside. Anyone has similar experience and knows how to open other po

[CODE4LIB] OCR for handwritten pages

2010-01-13 Thread Han, Yan
Hello, Colleagues, Does anyone know/use any OCR software working on handwritten pages? or at least think it is better than hiring a student key-in. I know these OCR software such as ABBYY, but they do not work on handwriting. Thanks, Yan

Re: [CODE4LIB] Assigning DOI for local content

2009-11-19 Thread Han, Yan
ss Singer Sent: Wednesday, November 18, 2009 8:11 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Assigning DOI for local content On Wed, Nov 18, 2009 at 12:19 PM, Han, Yan wrote: > Currently DOI uses Handle (technology) with it social framework (i.e. > administrative body to man

Re: [CODE4LIB] Assigning DOI for local content

2009-11-18 Thread Han, Yan
Currently DOI uses Handle (technology) with it social framework (i.e. administrative body to manage DOI). In technical sense, PURL is not going to last long. Crossref handles DOI registration in U.S. In Europe and Aisa, they have other organizations to handle it. DOI is also currently going thr

Re: [CODE4LIB] Digital imaging questions

2009-06-18 Thread Han, Yan
There are two things about archive images at least I can think of this moment: 1. the resolution: diff size/materials require different resolution. There is no one-size-fit-all. To make a judgment, I would like to know the image (color?), the size of the material?, 2. the file format: TIFF is th

Re: [CODE4LIB] Recommend book scanner?

2009-05-04 Thread Han, Yan
The National Archives has the guideline which describes target that you can use for scanning comparison. There are other targets used in other books/articles. I suggest that you check the National Archives' guidelines. http://www.archives.gov/preservation/technical/guidelines.html -Original

Re: [CODE4LIB] Recommend book scanner?

2009-05-01 Thread Han, Yan
That is right. In addition, for certain printing (gold seal), digital camera delivers better result than scanners. -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Jonathan Rochkind Sent: Friday, May 01, 2009 2:38 PM To: CODE4LIB@LISTSERV.ND.ED

Re: [CODE4LIB] You got it!!!!! Re: [CODE4LIB] Something completely different

2009-04-10 Thread Han, Yan
Bill and Peter, Very nice posts. XML, RDF, MARC and DC are all different ways to present information in a way (of course, XML, RDF, and DC are easier to read/processed by machine). However, down the fundamentals, I think that it can go deeper, basically data structure and algorithms making th

Re: [CODE4LIB] Something completely different

2009-04-06 Thread Han, Yan
Well, the future of ILS is to use general computing standards without making library's own. Essentially, from a computing theory view, a graph is the way to present all the info (i.e. a graph can represent a tree, or a line. When you look at MARC, it is a linear computing model.) Graph is powerfu

Re: [CODE4LIB] OCR engine for Persian/Dari

2009-02-04 Thread Han, Yan
Mark, Many thanks for your input. This is one of the packages that I am thinking of. Good to know its accuracy. Yan -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Mark Jordan Sent: Tuesday, February 03, 2009 5:36 PM To: CODE4LIB@LISTSERV.ND.

[CODE4LIB] OCR engine for Persian/Dari

2009-02-03 Thread Han, Yan
Hello, Do you know an OCR engine for Persian/Dari ? If so, what is the accurate rate? Thanks, Yan

[CODE4LIB] Linux tools for making PDFs

2009-02-03 Thread Han, Yan
Hello, Do you know a tool running under Linux to make PDFs from images? I use Adobe Acrobat professional in Windows to create PDFs from image files. However, Acrobat does not handle image files with east Asian characters. Yan

Re: [CODE4LIB] Is there a utility to open a folder of many pdfs and determine if each one will open? (eom)

2009-01-29 Thread Han, Yan
try PDFBox. It can index PDF documents. -Original Message- From: Code for Libraries on behalf of Thomas Dowling Sent: Wed 1/28/2009 2:37 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Is there a utility to open a folder of many pdfs and determine if each one will open? (eom)

Re: [CODE4LIB] MARC 21 and MODS

2009-01-29 Thread Han, Yan
I clicked 2 URLs, and they are broken. What happened? "404 Not Found There is no SKOS Concept, ConceptScheme, or Collection instance in the registry available using this resource URI." -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Tim Cornwel

[CODE4LIB] ETD package for ProQuest/UMI old and new delivery platforms

2009-01-21 Thread Han, Yan
Hello, All, As mentioned before, I have received quite a few inquiries about the packages. I have created a web page so that you can download them. I have also made some fixes on the package. The software package does: * Unzip ProQuest/UMI ETD delivery Zipped files, and create one dir

[CODE4LIB] software package for Elec. Theses/dissertations

2009-01-07 Thread Han, Yan
Hello, Colleagues, As ProQuest/UMI switched its delivery platform for Electronic Theses and dissertations(ETD), I have developed a small software package to process ETD. The software package does: 1. Unzip ProQuest/UMI ETD delivery Zipped files, and create one directory per ETD. 2.