I'm running into an issue trying to upload data:
C:\Users\zablahre\Documents\AR\bulkload>appcfg.py upload_data --config=bulkl r.yaml --filename=quiz_load.dat --kind=Quiz --url=http://myarquiz.appspot.co mote_api Uploading data records. [INFO ] Logging to bulkloader-log-20101120.114440 [INFO ] Throttling transfers: [INFO ] Bandwidth: 250000 bytes/second [INFO ] HTTP connections: 8/second [INFO ] Entities inserted/fetched/modified: 20/second [INFO ] Batch Size: 10 [INFO ] Opening database: bulkloader-progress-20101120.114440.sql3 [INFO ] Connecting to myarquiz.appspot.com/remote_api Please enter login credentials for myarquiz.appspot.com Email: Password for : [INFO ] Starting import; maximum 10 entities per post [ERROR ] [Thread-12] DataSourceThread: Traceback (most recent call last): File "C:\Program Files\Google\google_appengine\google\appengine\tools\bulk er.py", line 1448, in run self.PerformWork() File "C:\Program Files\Google\google_appengine\google\appengine\tools\bulk er.py", line 1564, in PerformWork for item in content_gen.Batches(): File "C:\Program Files\Google\google_appengine\google\appengine\tools\bulk er.py", line 487, in Batches self._ReadRows(key_start, key_end) File "C:\Program Files\Google\google_appengine\google\appengine\tools\bulk er.py", line 409, in _ReadRows row = self.reader.next() File "C:\Program Files\Google\google_appengine\google\appengine\ext\bulklo sv_connector.py", line 186, in generate_import_record for input_dict in self.dict_generator: File "C:\Python27\lib\csv.py", line 104, in next row = self.reader.next() File "C:\Program Files\Google\google_appengine\google\appengine\ext\bulklo sv_connector.py", line 42, in utf8_recoder for line in codecs.getreader(encoding)(stream): File "C:\Python27\lib\codecs.py", line 615, in next line = self.readline() File "C:\Python27\lib\codecs.py", line 530, in readline data = self.read(readsize, firstline=True) File "C:\Python27\lib\codecs.py", line 477, in read newchars, decodedbytes = self.decode(data, self.errors) UnicodeDecodeError: 'utf8' codec can't decode byte 0xe1 in position 0: inval ontinuation byte [INFO ] An error occurred. Shutting down... . [ERROR ] Error in data source thread: 'utf8' codec can't decode byte 0xe1 osition 0: invalid continuation byte [INFO ] 0 entites total, 0 previously transferred [INFO ] 10 entities (9046 bytes) transferred in 37.4 seconds [INFO ] Some entities not successfully transferred I need help trying to troubleshoot this one.. I'm trying to point out which record creates this issue. I have tried a couple of option like -noisy and -verbose but do not get any more info than what I'm posting today. The data looks like this... voice,quizno,booklevel,language,author,title,fiction,pointvalue,intlevel -,49292,1.2,English,Caroline Arnold,"Sleepytime for Zoo Animals",Nonfiction,0.5,LG +,7243,1.2,English,Harriet Ziefert,"So Sick!",Fiction,0.5,LG -,79536,1.2,English,Margaret Hall,"Spiders",Nonfiction,0.5,LG +,7293,1.2,English,Harriet Ziefert,"Take My Picture!",Fiction,0.5,LG -,55923,1.2,English,Shen Roddie,"Toes Are to Tickle",Fiction,0.5,LG -,50144,1.2,English,Dana Meachen Rau,"Uncle's Bakery",Fiction,0.5,LG -,51274,1.2,English,Katie Davis,"Who Hops.",Fiction,0.5,LG +,31630,1.2,English,Keith Baker,"Who Is the Beast.",Fiction,0.5,LG ....There are over 6000 rows.... FYI I enclosed the title in quotes because some of them contain commas and since the data is separated by commas... Would appreciate anything I can try bulkloader.yaml: =========== # Autogenerated bulkloader.yaml file. # You must edit this file before using it. TODO: Remove this line when done. # At a minimum address the items marked with TODO: # * Fill in connector and connector_options # * Review the property_map. # - Ensure the 'external_name' matches the name of your CSV column, # XML tag, etc. # - Check that __key__ property is what you want. Its value will become # the key name on import, and on export the value will be the Key # object. If you would like automatic key generation on import and # omitting the key on export, you can remove the entire __key__ # property from the property map. # If you have module(s) with your model classes, add them here. Also # change the kind properties to model_class. python_preamble: - import: base64 - import: re - import: google.appengine.ext.bulkload.transform - import: google.appengine.ext.bulkload.bulkloader_wizard - import: google.appengine.ext.db - import: google.appengine.api.datastore - import: google.appengine.api.users transformers: - kind: Quiz connector: csv # TODO: Choose a connector here: csv, simplexml, etc... connector_options: # TODO: Add connector options here--these are specific to each connector. property_map: - property: __key__ external_name: key export_transform: transform.key_id_or_name_as_string - property: author external_name: author # Type: String Stats: 1 properties of this type in this kind. - property: booklevel external_name: booklevel # Type: String Stats: 1 properties of this type in this kind. - property: fiction external_name: fiction # Type: String Stats: 1 properties of this type in this kind. - property: intlevel external_name: intlevel # Type: String Stats: 1 properties of this type in this kind. - property: language external_name: language # Type: String Stats: 1 properties of this type in this kind. - property: pointvalue external_name: pointvalue # Type: String Stats: 1 properties of this type in this kind. - property: quizno external_name: quizno # Type: String Stats: 1 properties of this type in this kind. - property: title external_name: title # Type: String Stats: 1 properties of this type in this kind. - property: voice external_name: voice # Type: String Stats: 1 properties of this type in this kind. -- You received this message because you are subscribed to the Google Groups "Google App Engine for Java" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/google-appengine-java?hl=en.
