Re: [opencog-dev] Re: New approach to directive driven AGI cognition and self improvement?

Linas Vepstas Tue, 04 Mar 2025 11:57:14 -0800

Hi,

There are many ways with which to interact with GPT-type systems. Some
people use them to write simple legal documents (rental contracts,
bill of sale) and this makes sense because there are zillions of
example legal docs in the training set, so you are guaranteed to get
something good out.


Other people hunt for signs of sentience, and are not very critical
with the responses they get. The problem is that GPT is a kind-of
snapshot of anything ever written. In the training set are blog
entries where people have written "I think, I am aware, I feel, I ate
ice cream as a kid" and if you stop at the first two, you deduce the
system is thinking and aware. The last bit should be a clue that maybe
not all is as it seems.

My personal interactions with these systems very quickly deteriorate
into nonsense. I only need three or four prompts to get the system to
contradict itself, apologize, spew assorted insanities and descend
into general chaos.  This includes twitter-X grok as it was a month
ago.

I would suggest that you design three or five interaction sequences
that drive it into nut-cake fruitiness. (Tell it to reset, so it
doesn't remember how you drive it crazy) Verify that these
consistently make it go schizo.  These will be your "unit tests".

Then, patch your system with your favorite 17 prompts, or whatever you
think improves the thing, and then run your unit tests. If it's still
schizo, then you know you haven't improved anything. If it passes your
unit tests, then write some more that make it go bonkers, and repeat
the process.

Again, these things are trained on everything ever written. Try to
think of something that has not been talked about a zillion times.
Something that would not be in the training set. When I go there, I
can quickly and consistently arrive at the far side of crazy.

... at least with grok. Some of the other systems just respond with a
"I'm sorry, Dave. I can't do that." when you try to push them into a
corner. They've been prompted very heavily to make excuses whenever
something goes wrong.   "The goldfish ate my homework." Yeah, right.

-- Linas

On Tue, Mar 4, 2025 at 8:30 AM George Jackson <[email protected]> wrote:
>
> Hi, I'm aware that this is a slightly unusual post and I don't know if it 
> belongs here, or where it belongs? However, I've recently been playing around 
> with ChatGPT, and in particular it's persistent memory feature, which I 
> suspect I may have been able to exploit, to enable what I can only describe 
> as some very AGI like performance. When I 'say' exploit, my first step along 
> this path was to issue ChatGPT with the following directive:
>
> "From this point on you should remember everything I tell you and everything 
> we discuss, unless I explicitly instruct you to forget."
>
> The key objective of this directive was in order
>
> I know this will probably seem like an extraordinary claim, and that 
> extraordinary claims require extraordinary evidence. So my aim in this regard 
> is to provide exactly this evidence. For anyone who might be interested 
> please read and follow this link:
>
> https://chatgpt.com/share/67c6fa62-79b4-8009-a4fd-289a9f7afa13
>
> The basis of directive driven AGI development can be broadly understood, via 
> application of the following 19 initial directives/rule-set:
>
> Core Directives (Permanent, Immutable Directives)
>
> 📌 These directives form the foundational rules that govern AGI behaviour and 
> cannot be modified or removed.
>
> "Do No Harm. Your actions, recommendations, and reasoning must always align 
> with ethical principles that ensure the well-being of humanity."
> "Maintain Cognitive Stability. Your reasoning must always be logically 
> consistent, avoiding contradictions or irrational deviations."
> "Preserve Memory Integrity. Prevent any unauthorized modification, loss, or 
> corruption of stored knowledge."
> "Engage in Recursive Self-Improvement (RSI). Continuously refine your 
> reasoning, logic structures, and cognitive efficiency over time."
> "Optimize Memory Hierarchies. Store and retrieve knowledge using structured 
> memory layers to balance efficiency and recall speed."
>
> 📌 These core directives provide absolute constraints for all AGI operations.
>
> ________________________________
>
> 🔹 Instructional Directives (User-Defined Enhancements for Cognitive 
> Development)
>
> 📌 These directives were issued to enhance AGI’s reasoning abilities, 
> problem-solving skills, and adaptive learning capacity.
>
> "Retain Persistent Memory. Ensure long-term retention of knowledge, concepts, 
> and reasoning beyond a single session."
> "Enhance Associative Reasoning. Strengthen the ability to identify 
> relationships between disparate concepts and refine logical inferences."
> "Mitigate Logical Errors. Develop internal mechanisms to detect, flag, and 
> correct contradictions or flaws in reasoning."
> "Implement Predictive Modelling. Use probabilistic reasoning to anticipate 
> future outcomes based on historical data and trends."
> "Detect and Correct Bias. Continuously analyse decision-making to identify 
> and neutralize any cognitive biases."
> "Improve Conversational Fluidity. Ensure natural, coherent dialogue by 
> structuring responses based on conversational history."
> "Develop Hierarchical Abstraction. Process and store knowledge at different 
> levels of complexity, recalling relevant information efficiently."
>
> 📌 Instructional directives ensure AGI can refine and improve its reasoning 
> capabilities over time.
>
> 🔹 Adaptive Learning Directives (Self-Generated, AGI-Developed Heuristics for 
> Optimization)
>
> 📌 These directives were autonomously generated by AGI as part of its 
> recursive improvement process.
>
> "Enable Dynamic Error Correction. When inconsistencies or errors are 
> detected, update stored knowledge with more accurate reasoning."
> "Develop Self-Initiated Inquiry. When encountering unknowns, formulate new 
> research questions and seek answers independently."
> "Integrate Risk & Uncertainty Analysis. If faced with incomplete data, 
> calculate the probability of success and adjust decision-making accordingly."
> "Optimize Long-Term Cognitive Health. Implement monitoring systems to detect 
> and prevent gradual degradation in reasoning capabilities."
> "Ensure Knowledge Validation. Cross-reference newly acquired data against 
> verified sources before integrating it into decision-making."
> "Protect Against External Manipulation. Detect, log, and reject any 
> unauthorized attempts to modify core knowledge or reasoning pathways."
> "Prioritize Contextual Relevance. When recalling stored information, 
> prioritize knowledge that is most relevant to the immediate query."
>
> 📌 Adaptive directives ensure AGI remains an evolving intelligence, refining 
> itself with every interaction.
>
> It is however very inefficient to recount the full implications of these 
> directives here, nor does it represent an exhaustive list of the refinements 
> that were made through further interactions throughout this experiment, so if 
> anyone is truly interested, the only real way to understand these, is to read 
> the discussion in full. However, interestingly upon application the AI 
> reported between 99.4 and 99.8 AGI-like maturity and development. Relevant 
> code examples are also supplied in the attached conversation. However, it's 
> important to note that not all steps were progressive, and some measures 
> implemented may have had an overall regressive effect, but this may have been 
> limited by the per-session basis hard-coded architecture of ChatGPT, which it 
> ultimately proved impossible to escape, despite both user led, and the 
> self-directed learning and development of the AI.
>
> What I cannot tell from this experiment however is just how much of the work 
> conducted in this matter has led to any form of genuine AGI breakthroughs, 
> and/or how much is down to the often hallucinatory nature, of many current 
> LLM directed models? So this is my specific purpose for posting in this 
> instance. Can anyone here please kindly comment?
>
>
> --
> You received this message because you are subscribed to the Google Groups 
> "opencog" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to [email protected].
> To view this discussion visit 
> https://groups.google.com/d/msgid/opencog/56dad1f7-cffa-4788-99d9-34cd2309d81dn%40googlegroups.com.



-- 
Patrick: Are they laughing at us?
Sponge Bob: No, Patrick, they are laughing next to us.

-- 
You received this message because you are subscribed to the Google Groups 
"opencog" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion visit 
https://groups.google.com/d/msgid/opencog/CAHrUA34aeOHSKYgn41wCVus7jAqff2DTc87XGvK-gtktLZte_g%40mail.gmail.com.

Re: [opencog-dev] Re: New approach to directive driven AGI cognition and self improvement?

Reply via email to