Hi,

Im writing a simple feed crawler using tasks. Feed entries are stored
in Entry model. I use guids to check whether an entry exists or not,
to avoid entries from begin crawled more than once.. But my checking
mechanism seems to be failing..

Here is the check function and Entry model that i use..

class Entry(db.Model):
        guid = db.StringProperty()
        title = db.TextProperty()

        @classmethod
        def exists(cls, guid):
                query = db.Query(cls, keys_only = True)
                key = query.filter('guid =', guid).get()
                if key:
                        logging.info('True : %s' % guid)
                        return True
                else:
                        logging.info('False : %s' % guid)
                        return False

and i get strange logs like this

I 12-18 11:29AM 06.720 True : http://www.techcrunch.com/?p=129969
I 12-18 11:29AM 06.721 False : http://www.techcrunch.com/?p=129937
I 12-18 11:29AM 06.722 False : http://www.techcrunch.com/?p=129963
I 12-18 11:29AM 06.723 False : http://www.techcrunch.com/?p=129883
.... all other entries are False ....


I 12-18 11:19AM 00.745 True : http://www.techcrunch.com/?p=129969
I 12-18 11:19AM 00.754 True : http://www.techcrunch.com/?p=129937
I 12-18 11:19AM 00.763 True : http://www.techcrunch.com/?p=129963
I 12-18 11:19AM 00.770 True : http://www.techcrunch.com/?p=129883
.... all other entries are True ....

am i doing something wrong? .. how is it possible that the same query
that returned a key 10min ago return None now?

--

You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en.


Reply via email to