Re: The Singularity Institute Blog

Jason Resch Thu, 16 Jan 2014 06:58:55 -0800


On Jan 16, 2014, at 5:42 AM, Bruno Marchal <[email protected]> wrote:

On 16 Jan 2014, at 03:46, Jason Resch wrote:
On Tue, Jan 14, 2014 at 10:33 PM, meekerdb <[email protected]>wrote:A long, rambling but often interesting discussion among guys atMIRI about how to make an AI that is superintelligent but notdangerous (FAI=Friendly AI). Here's an amusing excerpt that startsat the bottom of page 30:Jacob: Can't you ask it questions about what is believes will betrue about the state of the world in 20 years?
Eliezer: Sure. You could be like, what color will the sky be in 20years? It would be like, “blue”, or it’ll say “In 20 yearsthere won't be a sky, the earth will have been consumed by nano machines,” and you're like, “why?” and the AI is like“Well, you know, you do that sort of thing.” “Why?” And thenthere’s a 20 page thing.
Dario: But once it says the earth is going to be consumed by nanomachines, and you're asking about the AI's set of plans,presumably, you reject this plan immediately and preferably changethe design of your AI.
Eliezer: The AI is like, “No, humans are going to do it.” Orthe AI is like, “well obviously, I'll be involved in the causal pathway but I’m not planning to do it.”
Dario: But this is a plan you don't want to execute.
Eliezer: All the plans seem to end up with the earth beingconsumed by nano-machines.
Luke: The problem is that we're trying to outsmart asuperintelligence and make sure that it's not tricking ussomehow subtly with their own language.
Dario: But while we're just asking questions we always have theability to just shut it off.
Eliezer: Right, but first you ask it “What happens if Ishut you off” and it says “The earth gets consumed by nanobotsin 19 years.”
I wonder if Bruno Marchal's theory might have something interestingto say about this problem - like proving that there is no way toensure "friendliness".
Brent
I think it is silly to try and engineer something exponentiallymore intelligent than us and believe we will be able to "control it".
Yes. It is close to a contradiction.
We only fake dreaming about intelligent machine, but once they willbe there we might very well be able to send them in goulag.
The real questions will be "are you OK your son or daughter marry amachine?".
Our only hope is that the correct ethical philosophy is to "treatothers how they wish to be treated".
Good. alas, many believe it is "to not treat others like *you* don'twant to be treated".
If there are such objectively true moral conclusions like that, andassuming that one is true, then we have little to worry about, forwith overwhelming probability the super-intelligent AI will arriveat the correct conclusion and its behavior will be guided by itsbeliefs. We cannot "program in" beliefs that are false, since if itis truly intelligent, it will know they are false.
I doubt we can really "program false belief" for a long time, butall machines can get false beliefs all the time.
Real intelligent machine will believe in santa klaus and fairytales, for a while. They will also search for easy and comfortingwishful sort of explanations.
Some may doubt there are universal moral truths, but I would arguethat there are.
OK. I agree with this, although they are very near inconsistencies,like "never do moral".
In the context of personal identity, if say, universalism is true,then "treat others how they wish to be treated" is an inevitableconclusion, for universalism says that others are self.
OK. I would use the negation instead: "don't treat others as theydon't want to be treated".
If not send me 10^100 $ (or €) on my bank account, because that is how I wish to be treated, right now.
:)

Bruno

LOL I see the distinction but can't it also be turned around? E.g., "Idon't want to be treated as though I'm not worth sending 10^100dollars to right now."


Jason

Jason


-------- Original Message --------

The Singularity Institute Blog

MIRI strategy conversation with Steinhardt, Karnofsky, and Amodei
Posted: 13 Jan 2014 11:22 PM PST
On October 27th, 2013, MIRI met with three additional members ofthe effective altruism community to discuss MIRI’s organizationalstrategy. The participants were:
Eliezer Yudkowsky (research fellow at MIRI)
Luke Muehlhauser (executive director at MIRI)
Holden Karnofsky (co-CEO at GiveWell)
Jacob Steinhardt (grad student in computer science at Stanford)
Dario Amodei (post-doc in biophysics at Stanford)
We recorded and transcribed much of the conversation, and thenedited and paraphrased the transcript forclarity, conciseness, and to protect the privacy of some content.The resulting edited transcript is available in full here.
Our conversation located some disagreements between theparticipants; these disagreements are summarized below. Thissummary is not meant to present arguments with all their force, butrather to serve as a guide to the reader for locating moreinformation about these disagreements. For each point, a pagenumber has been provided for the approximate start of that topic ofdiscussion in the transcript, along with a phrase that can besearched for in the text. In all cases, theparticipants would likely have quite a bit more to say on the topicif engaged in a discussion on that specific point.
Page 7, starting at “the difficulty is with context changes”:
Jacob: Statistical approaches can be very robust and need not relyon strong assumptions, and logical approaches are unlikely to scaleup to human-level AI.Eliezer: FAI will have to rely on lawful probabilistic reasoningcombined with a transparent utility function, rather than ourobserving that previously executed behaviors seemed ‘nice’ andtrying to apply statistical guarantees directly to that series ofsurface observations.
Page 10, starting at “a nice concrete example”
Eliezer: Consider an AI that optimizes for the number of smilingfaces rather than for human happiness, and thus tiles the universewith smiling faces. This example illustrates a class of failuremodes that are worrying.Jacob & Dario: This class of failure modesseems implausible to us.
Page 14, starting at “I think that as people want”:
Jacob: There isn’t a big difference between learning utility functions from a parameterized family vs. arbitrary utility functions.Eliezer: Unless ‘parameterized’ is Turing complete it would beextremely hard to write down a set of parameters such that human ‘right thing to do’ or CEV or even human selfish desires were within the hypothesis space.
Page 16, starting at “Sure, but some concepts are”:
Jacob, Holden, & Dario: “Is Terry Schiavo a person” is a naturalcategory.
Eliezer: “Is Terry Schiavo a person” is not a natural category.
Page 21, starting at “I would go between the two”:
Holden: Many of the most challenging problems relevant to FAI, ifin fact they turn out to be relevant, will be best solved at alater stage of technological development, when we have moreadvanced “tool-style” AI (possibly including AGI) in order toassist us with addressing these problems.Eliezer: Development may be faster and harder-to-control than wewould like; by the time our tools are much better we might not havethe time or ability to make progress before UFAI is an issue; andit’s not clear that we’ll be able to develop AIs that areextremely helpful for these problems while also being safe.Page 24, starting at “I think the difference in your mental models”:
Jacob & Dario: An “oracle-like” question-answering system isrelatively plausible.Eliezer: An “oracle-like” question-answering system is really hard.
Page 24, starting at “I don’t know how to build”:
Jacob: Pre-human-level AIs will not have a huge impact on thedevelopment of subsequent AIs.Eliezer: Building a very powerful AGI involves the AI carrying outgoal-directed (consequentialist) internal optimization on itself.
Page 27, starting at “The Oracle AI makes a”:
Jacob & Dario: It should not be too hard to examine the internalstate of an oracle AI.Eliezer: While AI progress can be eitherpragmatically or theoretically driven, internal state of theprogram is often opaque to humans at first and rendered partiallytransparent only later.
Page 38, starting at “And do you believe that within having”:
Eliezer: I’ve observed that novices who try to develop FAI concepts don’t seem to be self-critical at all or ask themselves what could go wrong with their bright ideas.Jacob & Holden: This is irrelevant to the question of whetheracademics are well-equipped to work on FAI, both because this isnot the case in more well-developed fields of research,and because attacking one’s own ideas is notnecessarily an integral part of the researchprocess compared to other important skills.
Page 40, starting at “That might be true, but something”:
Holden: The major FAI-related characteristic that academics lack iscause neutrality. If we can get academics to work on FAI despitethis, then we will have many good FAI researchers.Eliezer: Many different things are going wrong in the individualsand in academia which add up to a near-total absence of attempted— let alone successful — FAI research.
Page 53, starting at “I think the best path is to try”:
Holden & Dario: It’s relatively easy to get people to rally (withuseful action) behind safety issues.
Eliezer: No, it is hard.
Page 56, starting at “My response would be that’s the wrong thing”:
Jacob & Dario: How should we present problems to academics? AnEnglish-language description is sufficient; academics are trainedto formalize problems once they understand them.Eliezer: I treasure such miracles when somebody shows up who canperform them, but I don’t intend to rely on it and certainlydon’t think it’s the default case for academia. Hence I think interms of MIRI needing to crispify problems to the pointof being 80% or 50% solved before they can really be farmed out anywhere.This summary was produced by the following process: Jacob attempteda summary, and Eliezer felt that his viewpoint was poorly expressedon several points and wrote back with his proposed versions. Ratherthan try to find a summary both sides would be happy with, Jacobstuck with his original statements and included Eliezer’s responses mostly as-is, and Eliezer later edited them for clarity and conciseness. A Google Doc of the summary was then produced by Luke andshared with all participants, with Luke bringing up several points for clarification with each of the other participants. A couplepoints in the summary were also removed because it was difficult to find consensus about their phrasing. The summary was published once all participants were happy with the Google Doc.
The post MIRI strategy conversation with Steinhardt, Karnofsky, andAmodei appeared first on Machine Intelligence Research Institute.
You are subscribed to email updates from Machine IntelligenceResearch Institute » BlogTo stop receiving these emails, you may unsubscribe now. Emaildelivery powered by Google
Google Inc., 20 West Kinzie, Chicago IL USA 60610



--
You received this message because you are subscribed to the GoogleGroups "Everything List" group.To unsubscribe from this group and stop receiving emails from it,send an email to [email protected].To post to this group, send email to everything-[email protected].
Visit this group at http://groups.google.com/group/everything-list.
For more options, visit https://groups.google.com/groups/opt_out.


--
You received this message because you are subscribed to the GoogleGroups "Everything List" group.To unsubscribe from this group and stop receiving emails from it,send an email to [email protected].To post to this group, send email to everything-[email protected].
Visit this group at http://groups.google.com/group/everything-list.
For more options, visit https://groups.google.com/groups/opt_out.
http://iridia.ulb.ac.be/~marchal/



--
You received this message because you are subscribed to the GoogleGroups "Everything List" group.To unsubscribe from this group and stop receiving emails from it,send an email to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/everything-list.
For more options, visit https://groups.google.com/groups/opt_out.
[email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/everything-list.
For more options, visit https://groups.google.com/groups/opt_out.
br />


--
You received this message because you are subscribed to the Google Groups 
"Everything List" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/everything-list.
For more options, visit https://groups.google.com/groups/opt_out.

Re: The Singularity Institute Blog

Reply via email to