#858: Inspire: eprint search
----------------------------------+------------------------
Reporter: annetteh | Owner:
Type: defect | Status: new
Priority: major | Component: WebSearch
Version: | Resolution:
Keywords: Inspire eprint search |
----------------------------------+------------------------
Comment (by hoc):
Should eprints be handled this way (in indexing and search), simply boil
them down to their essential components, [a-z0-9]:
Remove all non-[a-z0-9] and then index as both [a-z]+\d{7} and \d{7}
So user can write the eprint any way in a search "find eprint..."
hep-th/9204057, hep-th 9204057, hepth/9204057 --> hepth9204057
as well just searching on 9204057 will find it (and maybe some others)
for eprints of the form arXiv:1202.1256 remove all \D, so
arXiv:1202.1256 -> 12021256
then searching "find eprint ..."
1202.1256, arxiv:1202.1256, arxiv/1202.1256, 1202/1256, etc will find it.
--
Ticket URL: <http://invenio-software.org/ticket/858#comment:1>
Invenio <http://invenio-software.org>