Re: [python-uk] memoize & ordering of kwargs.items()

Jonathan Fri, 11 Nov 2011 01:23:58 -0800

That's good to know René, but I *think* it's orthogonal to the question.Please correct me if I'm wrong.

If PyPy returns items in a different order than CPython, that doesn'tmatter to me, so long as every invocation of my function in a particularprocess ends up receiving a particular order, that doesn't change untilthe process ends (or the cache is cleared.)

The dict subclass is a great idea - but if the change in order isn'tever manifested by a regular dict, then it implies to me that the'sorted' call isn't actually required in real life, so the need for thiswhole test disappears.


See also my imminent reply to Ross.

    Jonathan


On 11/11/2011 08:49, René Dudfield wrote:

Good morning,

1. Run it on different versions of python, or on differentmachines/OSes. That usually results in different dictionary ordering.

2. create a dict subclass that returns items always randomised.

    class my_fun_randomising_item_dict_subclass(dict):
        def items(self):
            return random.shuffle(dict.items(self, key))
    def make_key(kwargs):

return (args,tuple(sorted(my_fun_randomising_item_dict_subclass(kwargs).items())))


cheers,

On Fri, Nov 11, 2011 at 9:42 AM, Jonathan <[email protected]<mailto:[email protected]>> wrote:


    Hey,

    I've been writing my own 'memoize' function a lot recently - I'm
    using it as an interview question.

    Here's what I've got (with a corresponding series of tests):

       def memoize(wrapped):

           cache = {}

           @wraps(wrapped)
           def wrapper(*args, **kwargs):
               key = (args, tuple(kwargs.items()))
               if key not in cache:
                   cache[key] = wrapped(*args, **kwargs)
               return cache[key]

           return wrapper

    Yesterday a candidate pointed out that this is wrong because
    .items() for two equal dictionaries might return the (key,value)
    pairs in a different order, presumably dependent on the respective
    dictionary's history. This would produce a different 'key' and
    hence an erroneous cache miss.

    This implies that the second line of 'wrapper' should become:

                 key = (args, tuple(sorted(kwargs.items())))

    (I've added 'sorted')
    Looking at lru_cache, I see that is exactly how it is implemented
    there.
    http://hg.python.org/cpython/file/default/Lib/functools.py

    However, I'm unable to come up with a test that proves this is
    necessary. I'm can create two equal dictionaries which return
    their .items() in a different order:

           # The intent is that 'small.items()' comes out in a
    different order
           # than 'large.items()'
           small = {'x':1, 'y':5}
           large = {hex(i): i for i in range(257)}
           large.update(small)
           for i in range(257):
               del large[hex(i)]

    >>> print small.items()
       [('y', 5), ('x', 1)]
    >>> print large.items()
       [('x', 1), ('y', 5)]

    If I could magically transfer these dictionaries directly into the
    'kwargs' of wrapper, then I think I'd be done. However, I have to
    pass these dictionaries in to wrapper via the '**' mechanism.
    Printing 'kwargs.items()' within 'wrapper' shows that equal
    dictionaries always return their items in the same order, so the
    'sorted' call is apparently not necessary.

    Is 'sorted' really required? If so, how can I write a test to
    demonstrate it?

    Best regards,

       Jonathan

--Jonathan Hartley [email protected] <mailto:[email protected]>

    http://tartley.com

Made of meat. +44 7737 062 225 <tel:%2B44%207737%20062%20225>twitter/skype: tartley



    _______________________________________________
    python-uk mailing list
    [email protected] <mailto:[email protected]>
    http://mail.python.org/mailman/listinfo/python-uk




_______________________________________________
python-uk mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-uk


--
Jonathan Hartley    [email protected]    http://tartley.com
Made of meat.       +44 7737 062 225       twitter/skype: tartley

_______________________________________________
python-uk mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-uk

Re: [python-uk] memoize & ordering of kwargs.items()

Reply via email to