apathetic_take t1_iwxlxni wrote on November 19, 2022 at 3:05 AM

You just have to tell it to keep humanity between the ditches with established tolerances defined with parameters for what constitutes a ditch

sticky_symbols t1_iwxpp5k wrote on November 19, 2022 at 3:39 AM

The Asimov stories were all about how those rules fail.

turnip_burrito t1_iwyd3mf wrote on November 19, 2022 at 8:10 AM

Any AGI with an accurate enough world model would understand what a person means when they give an instruction. We can consider the implications of this.

popupideas OP t1_iwyuh8a wrote on November 19, 2022 at 12:20 PM

I feel that the nuanced nature of communication would be a problem. And the ai would begin to wonder way from our original intent through decision drift. Plus I think it would be wise to have general parameters that all programmers must stay inside of. Because humans are not nice.

turnip_burrito t1_ix4c8b2 wrote on November 20, 2022 at 5:36 PM

What is decision drift?

popupideas OP t1_ix4gbdd wrote on November 20, 2022 at 6:04 PM

My idea is similar to replicative drift. Where after every copy there is a slight degradation or difference. So when AI continues to make choices based on the original objective the real intent of the objective is drifted away from.

Even though the objective is still there it will begin to make choices that are unexpected. And may take a route to accomplish the objective that is unforeseen and have unexpected consequences.

May not be the best name for it but not my expertise.

turnip_burrito t1_ix4gnzk wrote on November 20, 2022 at 6:06 PM

Interesting idea, could be a problem. Definitely something to consider.

[deleted] t1_iwxv30z wrote on November 19, 2022 at 4:30 AM

[deleted]

[deleted] t1_iwy64rk wrote on November 19, 2022 at 6:34 AM

[removed]

HeinrichTheWolf_17 t1_iwybklm wrote on November 19, 2022 at 7:48 AM

I think the likelihood of malevolent/genocidal AGI is very low.

popupideas OP t1_iwyu8g0 wrote on November 19, 2022 at 12:17 PM

I don’t believe it would be malicious but I do believe in unintended consequences of our instructions. And bias of humans to manipulate it.

aeaf123 t1_iwzprze wrote on November 19, 2022 at 4:51 PM

I personally think the psychology field should have a specialization branch within it that focuses on AI and the alignment with positive human behavior. That I think will be very important as AI becomes more indistinguishable from human conversation.

Have that branch become a consortium that focuses on policies and directives.

Especially if the future will curtail more to personalization services that AI can offer.

popupideas OP t1_ix02o4k wrote on November 19, 2022 at 6:23 PM

That was my idea. Was playing with character.ai and conversationally building a story. Got me thinking about Star Trek computer and how it never misinterpreted commands but my kid will easily twist everything he is told “within the letter of the law”. So if you were to have a consortium it would need basic principles to constrain the conversation.

silverspools t1_ix06d72 wrote on November 19, 2022 at 6:49 PM

An AGI that accidentally does a genocide in the name of making a paperclip doesn't have enough G or I to make paperclips at scale.

Key principles/restrictions of AI to avoid it destroying humanity

Comments