Hi, Jason,
I strongly suggest to separate your DOI namespace/naming schema to be totally
independent of your choice of repository/system. DOI is an infrastructure
thing, and the main reason behind of assigning DOI is for persistency and
permanency. At some point any repository system will go aw
Yes. Use iText or PDFBox
These are common PDF libraries.
On 2/6/16, 2:24 PM, "Code for Libraries on behalf of Andrew Cunningham"
wrote:
>Hi all,
>
>I am working with PDF files in some South Asian and South East Asian
>languages. Each PDF has ActualText added for each tag in the PDF. Each P
Please share the posting with interested parties. Tucson has mild winter and
dry / warm summer. The person will be working with engaged and nice colleagues.
EMPLOYMENT OPPORTUNITY
Department Head, Office of Digital Innovation and Stewardship
The University of Arizona Libraries, Digital Innov
Hi,
Please share the posting. The libraries is located in Tucson, AZ, a metro city
with a small town feel. We have very nice weather in winter and not bad in
summer. The person will be working with engaged and nice colleagues.
Yan
Position Title: Librarian/Specialist, Metadata Services
Departm
Be aware of data transfer cost if you are using Glacier.
Glacier is excellent choice for archive use, but you want to be sure these
files shall not be accessed often.
You shall consider the total cost of ownership including data transfer
cost, which could be very expensive if you retrieve more tha
taset, but carries a GPLv2
license--maybe useful in some testing, and see if it's worth expanding on the
effort.
Best,
Charles Riley
____
From: Han, Yan [h...@u.library.arizona.edu]
Sent: Wednesday, April 17, 2013 8:14 PM
To: Jacobs, Jane W; Code for
23, 2013 6:28 AM
To: Han, Yan
Subject: RE: : Persian Romanization table
Hi Yan,
As per my message to the listserve, here are the config files for Urdu. If you
do a Persian config file, I d love to get it and if possible add it to the
MARC::Detrans site.
Let me know if you want to
Hello, All,
I have a project to deal with Persian materials. I have already uses Google
Translate API to translate. Now I am looking for an API to transliterate
/Romanize (NOT Translate) Persian to English (not English to Persian). In other
words, I have Persian in, and English out.
There is a R
Hello,
We have problems using III loading module to load MARC files (.mrc) to our
catalog. This is to use "Data Exchange" > "Load Electronic Records (itm)".
Basically non- English characters (French, Arabic ) will be changed to unknown
symbols. The MARC files (.mrk and .mrc) are verified befo
Hello, Paul,
I am sure that I can help out. You or he can drop me an email
h...@u.library.arizona.edu
Yan Han
The University of Arizona Libraries
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Paul
Orkiszewski
Sent: Thursday, July 12, 2012 8
Hello, All,
The Electronic Theses and Dissertation software is to help library staff
process ProQuest/UMI delivered ETD files.
It has been updated with new GUI interface and updated code so that Windows
users can use. It can also run under Linux. It is Java executable files called
"etd.jar". Yo
for you).
Yan Han, Associate Librarian
The University of Arizona Libraries
Phone: (520)307-2823
Email: h...@u.library.arizona.edu
From: Cindy Harper [mailto:char...@colgate.edu]
Sent: Monday, March 07, 2011 11:18 AM
To: Code for Libraries
Cc: Han, Yan
Subject: Re: [CODE4LIB] LAMP Hosting service
You can just buy a node from a variety of cloud providers such as Amazon EC2,
Linode etc. (It is very easy to build anything you want).
Yan
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Cindy
Harper
Sent: Sunday, March 06, 2011 10:54 AM
To
DSpace does Full-text search, you need to turn on the configuration file.
See UAL http://arizona.openrepository.com/arizona/
Yan
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Deng,
Sai
Sent: Wednesday, October 20, 2010 2:14 PM
To: CODE4LIB@LI
I would think DSpace, Fedora, and Eprint. DSpace is fairly easy to implement,
which has embargo support in 1.6
(https://wiki.duraspace.org/display/DSTEST/Embargo ).
I have an article comparing DSpace and Fedora, but was written 6 years ago.
DSpace has not been changed much, but Fedora is a diffe
Hello,
Currently we would like to have Amazon EC2 node hosting 2 applications: DSpace
and Koha (so that we need 4 ports). However, it seems to me that only port 80
and 8080 are available. Any other ports are not accessible from outside.
Anyone has similar experience and knows how to open other po
Hello, Colleagues,
Does anyone know/use any OCR software working on handwritten pages? or at least
think it is better than hiring a student key-in.
I know these OCR software such as ABBYY, but they do not work on handwriting.
Thanks,
Yan
ss
Singer
Sent: Wednesday, November 18, 2009 8:11 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Assigning DOI for local content
On Wed, Nov 18, 2009 at 12:19 PM, Han, Yan wrote:
> Currently DOI uses Handle (technology) with it social framework (i.e.
> administrative body to man
Currently DOI uses Handle (technology) with it social framework (i.e.
administrative body to manage DOI). In technical sense, PURL is not going to
last long.
Crossref handles DOI registration in U.S. In Europe and Aisa, they have other
organizations to handle it. DOI is also currently going thr
There are two things about archive images at least I can think of this moment:
1. the resolution: diff size/materials require different resolution. There is
no one-size-fit-all. To make a judgment, I would like to know the image
(color?), the size of the material?,
2. the file format: TIFF is th
The National Archives has the guideline which describes target that you
can use for scanning comparison. There are other targets used in other
books/articles.
I suggest that you check the National Archives' guidelines.
http://www.archives.gov/preservation/technical/guidelines.html
-Original
That is right.
In addition, for certain printing (gold seal), digital camera delivers better
result than scanners.
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
Jonathan Rochkind
Sent: Friday, May 01, 2009 2:38 PM
To: CODE4LIB@LISTSERV.ND.ED
Bill and Peter,
Very nice posts. XML, RDF, MARC and DC are all different ways to present
information in a way (of course, XML, RDF, and DC are easier to read/processed
by machine).
However, down the fundamentals, I think that it can go deeper, basically data
structure and algorithms making th
Well, the future of ILS is to use general computing standards without
making library's own.
Essentially, from a computing theory view, a graph is the way to present
all the info (i.e. a graph can represent a tree, or a line. When you
look at MARC, it is a linear computing model.) Graph is powerfu
Mark,
Many thanks for your input. This is one of the packages that I am thinking of.
Good to know its accuracy.
Yan
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Mark
Jordan
Sent: Tuesday, February 03, 2009 5:36 PM
To: CODE4LIB@LISTSERV.ND.
Hello,
Do you know an OCR engine for Persian/Dari ? If so, what is the accurate
rate?
Thanks,
Yan
Hello,
Do you know a tool running under Linux to make PDFs from images? I use
Adobe Acrobat professional in Windows to create PDFs from image files.
However, Acrobat does not handle image files with east Asian characters.
Yan
try PDFBox. It can index PDF documents.
-Original Message-
From: Code for Libraries on behalf of Thomas Dowling
Sent: Wed 1/28/2009 2:37 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Is there a utility to open a folder of many pdfs and
determine if each one will open? (eom)
I clicked 2 URLs, and they are broken. What happened?
"404 Not Found
There is no SKOS Concept, ConceptScheme, or Collection instance in the
registry available using this resource URI."
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
Tim Cornwel
Hello, All,
As mentioned before, I have received quite a few inquiries about the
packages. I have created a web page so that you can download them. I
have also made some fixes on the package. The software package does:
* Unzip ProQuest/UMI ETD delivery Zipped files, and create one
dir
Hello, Colleagues,
As ProQuest/UMI switched its delivery platform for Electronic Theses and
dissertations(ETD), I have developed a small software package to process ETD.
The software package does:
1. Unzip ProQuest/UMI ETD delivery Zipped files, and create one directory
per ETD.
2.
31 matches
Mail list logo