Re: Dataclasses, immutability(?), and ChatGPT

Thomas Passin Wed, 12 Apr 2023 05:30:08 -0700

On 4/12/2023 12:58 AM, dn via Python-list wrote:

Are dataclasses (or instances thereof) mutable or immutable?
- and in what sense?
Have been experimenting with ChatGPT. In particular: its possibilitiesas a trainer, good ideas for methods of introducing new topics, itscapability for drawing-up demonstrations or examples, its interpretationof assessment questions (ie evaluating my own work, eg do they testdesired specifics, is it possible/likely to misinterpret the question, ...)
Conversed with ChatGPT about the differences between (standard)custom-classes and dataclasses, and when one might prefer one over theother.
We(?) started discussing how dataclasses might be introduced to traineescurrently only familiar with Python's built-ins: "...how they aresuperior to using built-in data-constructs..."
Amongst the four benefits ChatGPT listed was:
«
Immutable instances: By default, instances of dataclasses are immutable,which means that once created, their attributes cannot be modified. Thiscan help to prevent unintended modifications to the data.
»
Huh? If we'd been discussing namedtuples over (say) dictionaries, I'dperhaps have accepted the reply.
The next step was to write some code to double-check my understanding(reproduced below). No change there.
(given that I'm perfect...cough, cough, splutter)


Thus, back to ChatGPT to seek clarification:
«
You said "Immutable instances: By default, instances of dataclasses areimmutable, which means that once created, their attributes cannot bemodified. This can help to prevent unintended modifications to thedata." The attribute-values within a data-class can be changed. In whatway are Python dataclasses immutable?
»

ChatGPT did indeed clarify that:
«
You're correct that the attribute values within a dataclass can bechanged, even though the instance itself is considered immutable.
»
That's better. The machine is as smart as I am - or I, as smart as it -or something(?)
However, the supporting-explanation then appears to argue against itsown clarification (and the assertion in the question):
«
So to clarify, when we say that instances of dataclasses are immutable,we mean that the attributes of an instance cannot be changed after it iscreated.
In other words, if you create an instance of a dataclass, its attributescannot be modified by any external code. If you want to change the valueof an attribute, you need to create a new instance of the dataclass withthe updated attribute value.
... [code excised - see below]
This is because the instance is considered immutable, and its attributescannot be changed directly. Instead, we need to create a new instance ofthe dataclass with the updated attribute value, like this...
Now, new_person is a new instance of the Person dataclass with theupdated attribute values, while the original person instance remainsunchanged.
So to summarize, while the attribute values within a dataclass can bechanged, instances of dataclasses are considered immutable because theirattributes cannot be modified directly once they are created. Anyupdates to the attributes require the creation of a new instance of thedataclass.
»

[Note]
In and amongst the above explanatory-paragraphs it gives examplecode-snippets. Those are condensed into:
 >>> from dataclasses import dataclass
 >>>
 >>> @dataclass
... class Person:
...     name: str
...     age: int
...
 >>> person = Person(name='Alice', age=20)
 >>>
 >>> person.name = 'Bob'  # raises AttributeError: can't set attribute
 >>> person.age = 21      # raises AttributeError: can't set attribute

- note the erroneous comments about exceptions being raised
(they weren't/aren't)!
Here's a home-grown version illustrating all the points made during theinvestigation, by way of proof/disproof:
""" PythonTraining:dataclass.py
     Prove/disprove claim made by Chat-GPT.
"""

__author__ = "dn, IT&T Consultant"
__python__ = "3.11"
__created__ = "PyCharm, 12 Apr 2023"
__copyright__ = "Copyright © 2023~"
__license__ = "MIT"

# PSL
from dataclasses import dataclass


@dataclass
class Coordinates():
     """Sample dataclass. """
     x:int
     y:int


if __name__ == "__main__":
     print( "\nCommencing execution\n" )

     coordinates = Coordinates( 1, 2, )
     print( coordinates, id( coordinates ), )
     coordinates.x = 3
     print( coordinates, id( coordinates ), )
     coordinates.z = 4
     print( coordinates, id( coordinates ), )
     print( coordinates.x, coordinates.y, coordinates.z, )


### output:
Commencing execution

Coordinates(x=1, y=2) 140436963150928
Coordinates(x=3, y=2) 140436963150928
Coordinates(x=3, y=2) 140436963150928
3 2 4

Terminating
###
Not only are a dataclass instance's attribute-values mutable, butfurther attributes can be dynamically-added to the object-instance!
Yes, if the code included:

coordinates = Coordinates( 5, 6, )
the new "coordinates" identifier would point to a different id()'address', ie a fresh immutable-instance.
The 'book of words' (https://docs.python.org/3/library/dataclasses.html)does mention immutability (wrt to dataclasses) in that it is possible toadd a __hash__() method (any object defined with is (technically)immutable). However, apart from the default_factory argument, theredoesn't appear to be other discussion of [im]mutability.
Anything I've 'missed'?
- or a salutary tale of not depending upon ChatGPT etc?

People need to remember that ChatGPT-like systems put words together theway that many humans usually do. So what they emit usually soundssmooth and human-like. If it's code they emit, it will tend to seemplausible because lines of code are basically sentences, and learninghow to construct plausible sentences is what these systems are built todo. That's **plausible**, not "logical" or "correct".

The vast size of these systems means that they can include a largercontext in figuring out what words to place next compared with earlier,smaller systems.

But consider: what if you wrote code as a stream-of-consciousnessprocess? That code might seem plausible, but why would you have anyconfidence in it? Or to put it another way, what if most of ChatGPT'sexposure to code came from StackOverflow archives?

On top of that, ChapGPT-like systems do not know your requirements northe reasons behind your requests. They only know that when other peopleput words and phrases together like you did, they tended to makeresponses that sound like what the chatbot emits next. It's basicallycargo-culting its responses.

Apparently researchers have been learning that the more parameters thata system like this has, the more likely it is to learn how to emitresponses that the questioner likes. Essentially, it could become theultimate yes-man!

So there is some probability that the system will tell you interestingor useful things, some probability that it will try to tell you what itthinks you want hear, some probability that it will tell you incorrectthings that other people have repeated, and some probability that itwill perseverate - simply make things up.

If I were going to write a novel about an alternate history, I thinkthat a chatGPT-like system would be a fantastic writing assistant.Code? Not so much.

--
https://mail.python.org/mailman/listinfo/python-list

Re: Dataclasses, immutability(?), and ChatGPT

Reply via email to