#858: Inspire: eprint search
----------------------------------+------------------------
Reporter:  annetteh               |       Owner:
    Type:  defect                 |      Status:  new
Priority:  major                  |   Component:  WebSearch
 Version:                         |  Resolution:
Keywords:  Inspire eprint search  |
----------------------------------+------------------------

Comment (by hoc):

 Should eprints be handled this way (in indexing and search), simply boil
 them down to their essential components, [a-z0-9]:

 Remove all non-[a-z0-9] and then index as both [a-z]+\d{7} and \d{7}
 So user can write the eprint any way in a search "find eprint..."
 hep-th/9204057, hep-th 9204057, hepth/9204057 --> hepth9204057
 as well just searching on 9204057 will find it (and maybe some others)

 for eprints of the form arXiv:1202.1256 remove all \D, so
 arXiv:1202.1256 -> 12021256
 then searching "find eprint ..."
 1202.1256, arxiv:1202.1256, arxiv/1202.1256, 1202/1256, etc will find it.

-- 
Ticket URL: <http://invenio-software.org/ticket/858#comment:1>
Invenio <http://invenio-software.org>

Reply via email to