#57: Invenio should identify itself through the user agent
------------------------+---------------------------------------------------
 Reporter:  bthiell     |       Owner:     
     Type:  task        |      Status:  new
 Priority:  minor       |   Milestone:     
Component:  BibBowling  |     Version:     
 Keywords:              |  
------------------------+---------------------------------------------------
 When urllib is used to query other websites, we should modify the user-
 agent in order to allow the remote server to identify the nature of the
 connection. This can be useful also if operations on the remote server
 have to be filtered using robots.txt.

 Below is a list of potential candidates for the change:

 bibcirculation_utils.py:    cover_xml =
 urllib.urlopen('http://ecs.amazonaws.com/onca/xml' \
 bibcirculationadminlib.py:        filexml =
 urllib.urlopen('http://ecs.amazonaws.com/onca/xml?' \
 bibcirculationadminlib.py:        cover_xml =
 urllib.urlopen('http://ecs.amazonaws.com/onca/xml' \
 bibclassify_ontology_reader.py:        url_desc = urllib2.urlopen(url)
 bfe_issn.py:        fields = urllib.urlopen(url.rstrip('/') +
 request).readlines()
 bibknowledge_regression_tests.py:        response =
 mechanize.urlopen("http://cdsware.cern.ch/download/invenio-demo-site-
 files/HEP.rdf")
 bibrankgkb.py:            page = urllib.urlopen(data[0][link])
 webjournaladminlib.py:    css_file = urlopen(css_path)
 bfe_webjournal_widget_seminars.py:            indico_xml =
 urlopen(indico_seminar_xml)
 bfe_webjournal_widget_weather.py:            feed =
 urlopen(yahoo_weather_rss)
 bibdocfile.py:        info = urllib2.urlopen(url).info()
 bibdocfile.py:            urllib2.urlopen(url)
 bibdocfile.py:                    from_file = urllib2.urlopen(url)

-- 
Ticket URL: <http://invenio-software.org/ticket/57>
Invenio <http://invenio-software.org>

Reply via email to