Jaya Naga Bhavana created SOLR-11666:
----------------------------------------
Summary: Solr Field with different language values
Key: SOLR-11666
URL: https://issues.apache.org/jira/browse/SOLR-11666
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Reporter: Jaya Naga Bhavana
*This is my python code:*
cmd = "curl localhost:8983/solr/" + core + "/update?commit=true -H
'Content-type:application/json' --data-binary " + "\"[{'id':'" +
getLastAddedDocumentID(
'id') + "','title':{'set':'" + title + "'},'author':{'set':'" +
authorNames + "'},'abstract':{'set':'" + abstract + "'}}]\""
print cmd
pp = subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)
text, err = pp.communicate()
print text
*My variable cmd which contains Curl command to add the data to fields is as
below:*
curl localhost:8983/solr/test/update?commit=true -H
'Content-type:application/json' --data-binary
"[{'id':'15973569-229c-4ce1-83e2-4f5ba543386f','title':{'set':'Bi\-criteria\
Algorithm\ for\ Scheduling\ Jobs\ on\ Cluster\ Platforms\
\*'},'author':{'set':'Pierre\-François\ Dutot\;\ Lionel\ Eyraud\;\ Grégory\
Gr´\;\ Grégory\ Mouní\;\ Denis\ Trystram\;\ '},'abstract':{'set':'We\ describe\
in\ this\ paper\ a\ new\ method\ for\ building\ an\ efficient\ algorithm\ for\
scheduling\ jobs\ in\ a\ cluster.\ Jobs\ are\ considered\ as\ parallel\ tasks\
\(PT\)\ which\ can\ be\ scheduled\ on\ any\ number\ of\ processors.\ The\ main\
feature\ is\ to\ consider\ two\ criteria\ that\ are\ optimized\ together.\
These\ criteria\ are\ the\ makespan\ and\ the\ weighted\ minimal\ average\
completion\ time\ \(minsum\).\ They\ are\ chosen\ for\ their\ complementarity,\
to\ be\ able\ to\ represent\ both\ user\-oriented\ objectives\ and\ system\
administrator\ objectives.\ We\ propose\ an\ algorithm\ based\ on\ a\ batch\
policy\ with\ increasing\ batch\ sizes,\ with\ a\ smart\ selection\ of\ jobs\
in\ each\ batch.\ This\ algorithm\ is\ assessed\ by\ intensive\ simulation\
results,\ compared\ to\ a\ new\ lower\ bound\ \(obtained\ by\ a\ relaxation\
of\ ILP\)\ of\ the\ optimal\ schedules\ for\ both\ criteria\ separately.\ It\
is\ currently\ implemented\ in\ an\ actual\ real\-size\ cluster\ platform.'}}]"
*The field abstract is as below:*
<field name="abstract" type="string" docValues="true" indexed="true"
stored="true"/>
*The problem which I face when running this command is:*
Traceback (most recent call last):
File "F:/pyCalculation/uploadResearchPaper.py", line 196, in <module>
addDocument(pathToResearchPapersFolder + department + '/', query,
department)
File "F:/pyCalculation/uploadResearchPaper.py", line 188, in addDocument
pp = subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)
File "C:\Python27\lib\subprocess.py", line 390, in __init__
errread, errwrite)
File "C:\Python27\lib\subprocess.py", line 610, in _execute_child
args = '{} /c "{}"'.format (comspec, args)
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe7' in position
267: ordinal not in range(128)
*And the line at position 276 is:*
'set':'Pierre\-François\ Dutot\;
The problem is with ç this character.
I am confused like Why doesn't solr allow to add this data to the field?
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]