#57: Invenio should identify itself through the user agent
------------------------+---------------------------------------------------
Reporter: bthiell | Owner:
Type: task | Status: new
Priority: minor | Milestone:
Component: BibBowling | Version:
Keywords: |
------------------------+---------------------------------------------------
When urllib is used to query other websites, we should modify the user-
agent in order to allow the remote server to identify the nature of the
connection. This can be useful also if operations on the remote server
have to be filtered using robots.txt.
Below is a list of potential candidates for the change:
bibcirculation_utils.py: cover_xml =
urllib.urlopen('http://ecs.amazonaws.com/onca/xml' \
bibcirculationadminlib.py: filexml =
urllib.urlopen('http://ecs.amazonaws.com/onca/xml?' \
bibcirculationadminlib.py: cover_xml =
urllib.urlopen('http://ecs.amazonaws.com/onca/xml' \
bibclassify_ontology_reader.py: url_desc = urllib2.urlopen(url)
bfe_issn.py: fields = urllib.urlopen(url.rstrip('/') +
request).readlines()
bibknowledge_regression_tests.py: response =
mechanize.urlopen("http://cdsware.cern.ch/download/invenio-demo-site-
files/HEP.rdf")
bibrankgkb.py: page = urllib.urlopen(data[0][link])
webjournaladminlib.py: css_file = urlopen(css_path)
bfe_webjournal_widget_seminars.py: indico_xml =
urlopen(indico_seminar_xml)
bfe_webjournal_widget_weather.py: feed =
urlopen(yahoo_weather_rss)
bibdocfile.py: info = urllib2.urlopen(url).info()
bibdocfile.py: urllib2.urlopen(url)
bibdocfile.py: from_file = urllib2.urlopen(url)
--
Ticket URL: <http://invenio-software.org/ticket/57>
Invenio <http://invenio-software.org>