I'm running into an issue trying to upload data:


C:\Users\zablahre\Documents\AR\bulkload>appcfg.py upload_data
--config=bulkl
r.yaml --filename=quiz_load.dat --kind=Quiz
--url=http://myarquiz.appspot.co
mote_api
Uploading data records.
[INFO ] Logging to bulkloader-log-20101120.114440
[INFO ] Throttling transfers:
[INFO ] Bandwidth: 250000 bytes/second
[INFO ] HTTP connections: 8/second
[INFO ] Entities inserted/fetched/modified: 20/second
[INFO ] Batch Size: 10
[INFO ] Opening database: bulkloader-progress-20101120.114440.sql3
[INFO ] Connecting to myarquiz.appspot.com/remote_api
Please enter login credentials for myarquiz.appspot.com
Email:
Password for :
[INFO ] Starting import; maximum 10 entities per post
[ERROR ] [Thread-12] DataSourceThread:
Traceback (most recent call last):
File "C:\Program
Files\Google\google_appengine\google\appengine\tools\bulk
er.py", line 1448, in run
self.PerformWork()
File "C:\Program
Files\Google\google_appengine\google\appengine\tools\bulk
er.py", line 1564, in PerformWork
for item in content_gen.Batches():
File "C:\Program
Files\Google\google_appengine\google\appengine\tools\bulk
er.py", line 487, in Batches
self._ReadRows(key_start, key_end)
File "C:\Program
Files\Google\google_appengine\google\appengine\tools\bulk
er.py", line 409, in _ReadRows
row = self.reader.next()
File "C:\Program
Files\Google\google_appengine\google\appengine\ext\bulklo
sv_connector.py", line 186, in generate_import_record
for input_dict in self.dict_generator:
File "C:\Python27\lib\csv.py", line 104, in next
row = self.reader.next()
File "C:\Program
Files\Google\google_appengine\google\appengine\ext\bulklo
sv_connector.py", line 42, in utf8_recoder
for line in codecs.getreader(encoding)(stream):
File "C:\Python27\lib\codecs.py", line 615, in next
line = self.readline()
File "C:\Python27\lib\codecs.py", line 530, in readline
data = self.read(readsize, firstline=True)
File "C:\Python27\lib\codecs.py", line 477, in read
newchars, decodedbytes = self.decode(data, self.errors)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xe1 in position 0:
inval
ontinuation byte
[INFO ] An error occurred. Shutting down...
.
[ERROR ] Error in data source thread: 'utf8' codec can't decode byte
0xe1
osition 0: invalid continuation byte
[INFO ] 0 entites total, 0 previously transferred
[INFO ] 10 entities (9046 bytes) transferred in 37.4 seconds
[INFO ] Some entities not successfully transferred



I need help trying to troubleshoot this one.. I'm trying to point out
which record creates this issue. I have tried a couple of option like
-noisy and -verbose but do not get any more info than what I'm posting
today.






The data looks like this...



voice,quizno,booklevel,language,author,title,fiction,pointvalue,intlevel
-,49292,1.2,English,Caroline Arnold,"Sleepytime for Zoo
Animals",Nonfiction,0.5,LG
+,7243,1.2,English,Harriet Ziefert,"So Sick!",Fiction,0.5,LG
-,79536,1.2,English,Margaret Hall,"Spiders",Nonfiction,0.5,LG
+,7293,1.2,English,Harriet Ziefert,"Take My Picture!",Fiction,0.5,LG
-,55923,1.2,English,Shen Roddie,"Toes Are to Tickle",Fiction,0.5,LG
-,50144,1.2,English,Dana Meachen Rau,"Uncle's Bakery",Fiction,0.5,LG
-,51274,1.2,English,Katie Davis,"Who Hops.",Fiction,0.5,LG
+,31630,1.2,English,Keith Baker,"Who Is the Beast.",Fiction,0.5,LG
....There are over 6000 rows....


FYI I enclosed the title in quotes because some of them contain commas
and since the data is separated by commas...


Would appreciate anything I can try


bulkloader.yaml:
===========

# Autogenerated bulkloader.yaml file.
# You must edit this file before using it. TODO: Remove this line when
done.
# At a minimum address the items marked with TODO:
# * Fill in connector and connector_options
# * Review the property_map.
# - Ensure the 'external_name' matches the name of your CSV column,
# XML tag, etc.
# - Check that __key__ property is what you want. Its value will become
# the key name on import, and on export the value will be the Key
# object. If you would like automatic key generation on import and
# omitting the key on export, you can remove the entire __key__
# property from the property map.


# If you have module(s) with your model classes, add them here. Also
# change the kind properties to model_class.
python_preamble:
- import: base64
- import: re
- import: google.appengine.ext.bulkload.transform
- import: google.appengine.ext.bulkload.bulkloader_wizard
- import: google.appengine.ext.db
- import: google.appengine.api.datastore
- import: google.appengine.api.users


transformers:


- kind: Quiz
connector: csv # TODO: Choose a connector here: csv, simplexml, etc...
connector_options:
# TODO: Add connector options here--these are specific to each
connector.
property_map:
- property: __key__
external_name: key
export_transform: transform.key_id_or_name_as_string


- property: author
external_name: author
# Type: String Stats: 1 properties of this type in this kind.


- property: booklevel
external_name: booklevel
# Type: String Stats: 1 properties of this type in this kind.


- property: fiction
external_name: fiction
# Type: String Stats: 1 properties of this type in this kind.


- property: intlevel
external_name: intlevel
# Type: String Stats: 1 properties of this type in this kind.


- property: language
external_name: language
# Type: String Stats: 1 properties of this type in this kind.


- property: pointvalue
external_name: pointvalue
# Type: String Stats: 1 properties of this type in this kind.


- property: quizno
external_name: quizno
# Type: String Stats: 1 properties of this type in this kind.


- property: title
external_name: title
# Type: String Stats: 1 properties of this type in this kind.


- property: voice
external_name: voice
# Type: String Stats: 1 properties of this type in this kind.

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine for Java" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/google-appengine-java?hl=en.

Reply via email to