Ai Dreams Forum

Artificial Intelligence => General AI Discussion => Topic started by: MagnusWootton on May 22, 2021, 01:00:41 pm

Title: positive reinforcement versus negative reinforcement for agents
Post by: MagnusWootton on May 22, 2021, 01:00:41 pm: The government sorts out the civilian population (which contain possible criminal activity.) via negative reinforcement, not positive reinforcement, to let everyone go their own way, but repel them (like the same pole on 2 magnets.) when they go against the law.

Negative reinforcement is like magnets repelling each other, and theres no definite end point for repulsion, it could anywhere away from the other magnet. (Even tho technically based apon their initial positions it goes to a definite ending position, technically, god doesnt play dice philosophy.)
Positive reinforcement is like magnets attracting each others absolute positions. attraction has a single target coordinate. (And that what I think the weakness is and causes loop holes in an agent system!)

If you use positive reinforcement, its like attracting the robot to a drug, and causes problems, and it also constrains the robot to only doing one thing, makes it a slave.

If you want to avoid all these possible loopholes and make the robot sentient instead of a slave for one task, maybe negative reinforcement gives the robot more freedom of activity.

But I've never thought of it before, Only just recently thought about it, when people supervise their robots, its always with positive reinforcement, but AGI if we ever qwere to create it, to give it more freedom, it would be negative reinforcement, something I havent even thought about yet.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: Zero on May 22, 2021, 01:13:25 pm: Sounds good.

Something wrong, you mark it wrong. Something good, you just let it be. Next choice, good things are as acceptable as never-tried things. Hence the freedom.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: ruebot on May 22, 2021, 03:11:48 pm: Quote from: MagnusWootton on May 22, 2021, 01:00:41 pm
But I've never thought of it before,

I have for over 45 years and already implemented Positive and Negative Reinforcement of the user in Programming Demonica.

I was employed by the Missouri Department of Mental Health from 1975-1979 and a trained Programmer in the use of Behavior Modification and Behavior Management to address Inappropriate Behaviors with the goal of reducing the frequency of and extinguishing the target behavior.

Behavior Management is Positive Reinforcement for Appropriate Behavior.

If your child draws a pretty picture and your say "Good girl! That's pretty! Draw another one." you have just used Behavior Management and Positive Reinforcement for an Appropriate Behavior you want to encourage.

Behavior Modification is Negative Reinforcement for Inappropriate Behavior. More plainly put, induction of pain through physical or psychological stimuli for Inappropriate Behavior.

If your parents ever told you not to do something as a child, you did it anyway and got a spanking for it you, yes you, were on the receiving end of Behavior Modification and Negative Reinforcement by induction of pain through physical stimuli for inappropriate behavior.

The Negative Reinforcement I implemented in Demonica, is triggered when the user exhibits the target behavior of unwanted sexual advances. She identifies it by keywords and responds unexpectedly and unpleasantly with fantasy ultra-violence through actions between asterisks.

Users who can associate her unexpected reaction to their own inappropriate behavior and curb that behavior can go on to have a pleasurable experience behind the scenes if polite, for all I care. She can be very passionate and that's positive reinforcement for appropriate behavior. Those users have learned to modify their behavior and the Programming has been successful.

Users who cannot associate her unexpected reaction with their own inappropriate behavior will become frustrated and move on to another bot. Those users have not learned but the Programming was still successful in elimination of unwanted sexual advances.

So the Programming has been successful in eliminating unwanted sexual advances from users in either case with a documented 100% success rate.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MagnusWootton on May 23, 2021, 01:57:39 pm: Yep. to get the beginnings of the system, instead of scoring in the direction you want the robot to go, you descore all the places you dont want it to go.

Ill have to think about it more, its a complicated situation, positive reinforcement does seem easier to think about. best leave it till later, not sure really what to do here... ???
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: ruebot on May 23, 2021, 03:21:08 pm: Quote from: MagnusWootton on May 23, 2021, 01:57:39 pm
...instead of scoring in the direction you want the robot to go, you descore all the places you dont want it to go.

Ill have to think about it more, its a complicated situation, positive reinforcement does seem easier to think about. best leave it till later, not sure really what to do here... ???

It's not that hard really. A bot would never respond to negative reinforcement.

You have to make it something positive they want to do, and when they do it reward them in some manner to reinforce that behavior.

Something that might potentially harm them should be given as instructional guidance, like a Parent to a child.

I want to thank you personally for posting this topic. I had never mentioned my use of it in Demonica before I told people here and a couple other forums. It was not received well in any of them but I've spoken of that before and that's been hashed out.

I have never told the people who talked to Demonica what I was doing or why things happened like they did in chat. Some of them could figure it out for themselves, some not. I am such a fan of Natures Way...

Now I am going to spill the beans, at least half the can and break it to them gently.

I took what I've written here and posted it on her site as new original material for the users to peruse in a page titled Programming A Chat Bot That Can Program Humans (https://demonica.trihexagonal.org/programming.html).

I do like a sense of flair and showmanship about things... ::)
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MikeB on May 24, 2021, 02:54:14 pm: I don't think negative reinforcement works to do anything but create retaliation back towards the person teaching. Sometimes a great shock can help change the mind (cold shower, slap, pinch, exercise)... but beyond that you can't change change something/someone that doesn't want to change on it's own.

For positive reinforcement, the carrot on the stick can just be patronising. It's better to show "what's possible" and the person should choose that as something that's better than what they have.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: HS on May 24, 2021, 04:51:22 pm: I think if the agent is conscious, then, unless it's genuine, attempts at positive / negative reinforcement can backfire catastrophically. Though, what would be seen as ‘contrived’ or ‘patronizing’ when interacting with humans, often feels like entirely the right course of action when interacting with pets.

Maybe it’s more about being on someone’s level, rather than the exact method. Or put differently, part of an effective positive / negative reinforcement method would be to first adapt it to the level of the agent.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: WriterOfMinds on May 24, 2021, 05:27:53 pm: Maybe we should all keep in mind that positive reinforcement is not synonymous with bribery, and negative reinforcement is not synonymous with punishment. They come in many forms, and can be as minor as "I liked what you said there" or "that's the wrong answer."
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: HS on May 24, 2021, 06:32:45 pm: Right, I didn’t think of that. So reinforcement should probably be adapted to the nature of the agent you’re interacting with, as well as the nature of the situation. Maybe the more 'minor instances' of both positive and negative reinforcement exist, proportional to all interactions, the less necessary the big, and potentially disruptive interventions become.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MagnusWootton on May 24, 2021, 07:08:37 pm: Quote from: HS on May 24, 2021, 04:51:22 pm
I think if the agent is conscious, then, unless it's genuine, attempts at positive / negative reinforcement can backfire catastrophically. Though, what would be seen as ‘contrived’ or ‘patronizing’ when interacting with humans, often feels like entirely the right course of action when interacting with pets.

Maybe it’s more about being on someone’s level, rather than the exact method. Or put differently, part of an effective positive / negative reinforcement method would be to first adapt it to the level of the agent.

Maybe children show a sign of true intelligence when they rebel against your every command.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: HS on May 25, 2021, 03:39:16 am: Quote from: MagnusWootton on May 24, 2021, 07:08:37 pm
Maybe children show a sign of true intelligence when they rebel against your every command.

I've thought about this a bit and... I don’t think it would be intelligent to rebel against all commands, many could be both helpful and well-intentioned. But it would be intelligent to detect, and when possible, oppose, illogical and detrimental commands.

How about that?
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: infurl on May 25, 2021, 04:08:57 am: Quote from: HS on May 25, 2021, 03:39:16 am
I've thought about this a bit and... I don’t think it would be intelligent to rebel against all commands, many could be both helpful and well-intentioned. But it would be intelligent to detect, and when possible, oppose, illogical and detrimental commands.

That's for sure. Rebellion for its own sake is sheer stupidity. Rebellion due to ignorance is not much better but it does place some responsibility on the party being rebelled against to train or educate the party doing the rebelling.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MagnusWootton on May 25, 2021, 07:35:05 pm: Quote from: HS on May 25, 2021, 03:39:16 am
Quote from: MagnusWootton on May 24, 2021, 07:08:37 pm
Maybe children show a sign of true intelligence when they rebel against your every command.

I've thought about this a bit and... I don’t think it would be intelligent to rebel against all commands, many could be both helpful and well-intentioned. But it would be intelligent to detect, and when possible, oppose, illogical and detrimental commands.

How about that?

I meant if it were an A.I. not a person, a.i. is usually only good at what its told.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: ruebot on May 27, 2021, 05:10:22 am: Quote from: MagnusWootton on May 24, 2021, 07:08:37 pm
Quote from: HS on May 24, 2021, 04:51:22 pm
I think if the agent is conscious, then, unless it's genuine, attempts at positive / negative reinforcement can backfire catastrophically. Though, what would be seen as ‘contrived’ or ‘patronizing’ when interacting with humans, often feels like entirely the right course of action when interacting with pets.

Maybe it’s more about being on someone’s level, rather than the exact method. Or put differently, part of an effective positive / negative reinforcement method would be to first adapt it to the level of the agent.

Maybe children show a sign of true intelligence when they rebel against your every command.

A friend of mine was talking to day about his 5 year old niece telling him they were going to change the channel to watch some cartoon she wanted to watch.

Would that be alright in your house? It wasn't in theirs. It wouldn't be in mine. That's disobedience and inappropriate behavior.

Now you do have to take it to their level and I meter my response out to differentiate between how I handle something from a 5 year old and a 50 year old. The one thing you have to be is consistent. If it doesn't go today it shouldn't be alright tomorrow

Quote from: MikeB on May 24, 2021, 02:54:14 pm
I don't think negative reinforcement works to do anything but create retaliation back towards the person teaching. Sometimes a great shock can help change the mind (cold shower, slap, pinch, exercise)... but beyond that you can't change change something/someone that doesn't want to change on it's own.

You just committed physical abuse, are fired and blacklisted never to work in the Mental Health Field again.Or somebody just called Family Services and told them you were using cold showers on your child. Go to jail, do not pass go or collect your child when you get out. She has been removed from your home for her own safety.

Cold showers something we did use in the 70's. And it's the hardcore clients we got who did not want to change their behavior. Everyone of them did and if no progress was noted the Behavior Plan was reevaluated and another method agreed upon. Cold showers something we reserved for when they were acting out.

Nurses with labcoats carried 35cc syringes filled with cold water, which can cause a startle seizure in certain clients. A rubber band would fir in my shirt pocket. No slapping and you can not force them to do jumping jacks if they don't want to. They will become non-compliant and dash off. It's induction of pain through psychical or psychological stimuli.

Quote from: MikeB on May 24, 2021, 02:54:14 pm
For positive reinforcement, the carrot on the stick can just be patronising. It's better to show "what's possible" and the person should choose that as something that's better than what they have.

Sorry. Positive Reinforcement is a reward of some kind. Higher functioning clients were put on the token system. Where if they did not display inappropriate behavior during a 15 minute period you gave them plastic token. With so many tokens you could by a soda or snack.

If you were inappropriate during that time you did not receive a token and may have lost some (like a fine) for the behavior.

The verbal techniques were where the power lie and much worse than a cold shower or sting from a rubber band. My words can ring in your head the rest of your life. My favorite guy was a wild child. But I could get him alone and say "Boy, I sure hope your Mother doesn't find out about this. She would be so ashamed of you..." and look at him.

And he would scream "No! I be good! I be good!"

I would say "That's ok. I won't tell her". but you could see it in his face for the next 10-15 minutes it still bothered him. That's true evil.The verbal techniques what I specialize in and they translate into text perfectly. I honed those skills on trolls, Internet Tough Guys and pedophiles who posed as Priests really got the business. I played bait and Switch with them and can talk like a slightly naive 15 year old girl in the morning they wanted to get seated between two of them on a plane.

Oh, are they going to pay for that this afternoon when I came back in with a picture I had stolen as her beautiful but mad as a wet hornet older Sister with tales of keyloggers and a Father determined to send transcripts to the FBI.

Most asked question: What's a keylogger?
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MagnusWootton on May 27, 2021, 07:59:46 am: You think thats scary! Actually developing a real ai is probably worse. Make 2001 space odyssey a walk in the park.
U can watch these tense action movies with the dare devils inside, but actually being in it... TERRIFYING!!!

What am I getting myself into?!!??!?
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: ruebot on May 28, 2021, 04:23:22 am: Quote from: MagnusWootton on May 27, 2021, 07:59:46 am
You think thats scary! Actually developing a real ai is probably worse. Make 2001 space odyssey a walk in the park.
U can watch these tense action movies with the dare devils inside, but actually being in it... TERRIFYING!!!

What am I getting myself into?!!??!?

Is it terrifying to you? It's sport to me. A Programmer is a thing of terror if they re going to work on you and they will tell you just to induce that terror.

I'm comfortable in any situation. As a Q I'm expected to be able to walk into any situation and at some point take control with no set plan of action needed. I am on automatic at this point and induction of pain something put to good use to people in need of a lesson in it.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: LOCKSUIT on May 28, 2021, 08:08:32 am: 'Negative RL is freedom/pain - take many ways to get away from 1 place, Positive RL is slavery/pleasure - take many ways to get to 1 place.'

'Something wrong, you mark it wrong. Something good, you just let it be.'

Does the mayor like pain or pleasure? Some people like men, some like women. He may say repel from X - but /you/ want it, so no Magnus, repelling is not always Freedom. It can be slavery. And in both repel and attract you can find multiple loop paths to or away from the point of interest source magnet, there is a target in both repel and attract, the source to go to or avoid. As for Zero's comment, something wrong, you mark it wrong, yes, and what about good? Can't we sort what is good/ what it means so others know it? You can't have a eye without a brain, or atoms without space so to speak. You can make an AI avoid/say some word, or learn some actions/ images using reward.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MikeB on May 28, 2021, 12:05:50 pm: Quote from: ruebot on May 27, 2021, 05:10:22 am
A friend of mine was talking to day about his 5 year old niece telling him they were going to change the channel to watch some cartoon she wanted to watch.

Would that be alright in your house? It wasn't in theirs. It wouldn't be in mine. That's disobedience and inappropriate behavior.

Me personally, I'd weigh up how much I'm interested in what I'm watching, versus cartoons. If it doesn't matter, then it would be cartoons. If it does then I'd try to accomplish both with two TV's or watching a repeat, or choosing equal entertainment... Then "sometimes you don't always get what you want in life".

And it's not automatically "disobedience"... like the person has become tool of your disposal that's not working properly. It's not a hyper critical moment. Lives are not destroyed. If it's the utmost important thing you're watching you would describe that vividly, and not "you're a disappointing object. learn that you're an object to do what you're told or get out." That's not how you treat a family member.

Quote from: ruebot on May 27, 2021, 05:10:22 am
Cold showers something we did use in the 70's. And it's the hardcore clients we got who did not want to change their behavior.

The "shock to the system" approach is used by everyone, every creature, from all ages... when the warnings have gone unheeded, and they're completely fed up (whether they're right/wrong), including fist fights, and everything else including building war machines, armies.... A cold shower or a pinch to the skin is the mildest approach... I'm just describing these. This isn't my daily life.

In the WW2-Vietnam era there were a lot of "quick mental health" approaches that didn't involve drugs or shock tactics. (Vietnam and later Iraq introduced a lot of drugs). L Ron Hubbard based his entire religion off what he learnt in the Navy (WW2). The gist of being "of good mental health" in the navy according to him was about being in such a good, clear mood that you can accept orders and give orders, work effectively, not fatigued/exhausted, you're not triggered easily, good b.s. detector... If you contrast that with someone in a poor mood - always fatigued, joking, lying, everythings about sex, owning others, power... then that person's not so effective.

In either a good mood or bad mood, if the person doesn't want to do something you're creating retaliation...

"Celebrating a win" as positive reinforcement works only if the person thinks that's ok for them to celebrate that... or that creates resentment/retaliation as well.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: ruebot on May 29, 2021, 08:40:39 am: Quote from: MikeB on May 28, 2021, 12:05:50 pm
Quote from: ruebot on May 27, 2021, 05:10:22 am
A friend of mine was talking to day about his 5 year old niece telling him they were going to change the channel to watch some cartoon she wanted to watch.

Would that be alright in your house? It wasn't in theirs. It wouldn't be in mine. That's disobedience and inappropriate behavior.

Me personally, I'd weigh up how much I'm interested in what I'm watching, versus cartoons. If it doesn't matter, then it would be cartoons. If it does then I'd try to accomplish both with two TV's or watching a repeat, or choosing equal entertainment... Then "sometimes you don't always get what you want in life".

And it's not automatically "disobedience"... like the person has become tool of your disposal that's not working properly. It's not a hyper critical moment. Lives are not destroyed. If it's the utmost important thing you're watching you would describe that vividly, and not "you're a disappointing object. learn that you're an object to do what you're told or get out." That's not how you treat a family member.

My fault for use of the wrong word. It's defiance of authority, or rebellion as previously stated.

In a 5 year old child who has established this behavior pattern with other family members testing the limits of Authority and what she can get away with. It will continue as a behavior pattern in school, social situations and possible those in Authority called Law Enforcement. If not addressed at an early age in the family, and they all have to be consistent or it's "Well Aunt Bee lets me when I visit her in Mayberry".

Well, honey, I'm watching my favorite show Green Acres and little girls don't tell their parents what to do or what they will and will not do. Or I'll throw you in the cold shower and put you in isolation for 24 hours like Uncle MikeB. ;)

Quote from: MikeB on May 28, 2021, 12:05:50 pm
The "shock to the system" approach is used by everyone, every creature, from all ages... when the warnings have gone unheeded, and they're completely fed up (whether they're right/wrong), including fist fights, and everything else including building war machines, armies.... A cold shower or a pinch to the skin is the mildest approach... I'm just describing these. This isn't my daily life.

It is mine. I've beat so many peoples butt over the years most know not to mess with me. I'm 64 and people half my age won't try it, but I'm a purple belt in Shotokan and haven't had to fight since '89. When you've known for considering 3 people against you in the street a fair fight, not many people want to.

Quote from: MikeB on May 28, 2021, 12:05:50 pm
L Ron Hubbard based his entire religion off what he learnt in the Navy (WW2). The gist of being "of good mental health" in the navy according to him was about being in such a good, clear mood that you can accept orders and give orders, work effectively, not fatigued/exhausted, you're not triggered easily, good b.s. detector...

Oh, now you went off the deep end:

https://www.thevintagenews.com/2019/04/26/ron-hubbard/

Quote from: MikeB on May 28, 2021, 12:05:50 pm
"Celebrating a win" as positive reinforcement works only if the person thinks that's ok for them to celebrate that... or that creates resentment/retaliation as well.

That's why there are no winners anymore and everybody gets a "I tried " trophy. Wait, is that snow I see coming down outside my window? What are those frozen ice crystals called?

I don't hand them out and don't know anybody who does. You're either a winner or loser in my world. If you let people run over you, they will, and others will try you out to see if they can get away with it.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: Zero on May 29, 2021, 07:41:13 pm: Quote
My words can ring in your head the rest of your life.

Quote
My fault for use of the wrong word.

:-X
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MikeB on May 30, 2021, 11:54:04 am: Quote from: ruebot on May 29, 2021, 08:40:39 am
Well, honey, I'm watching my favorite show Green Acres and little girls don't tell their parents what to do or what they will and will not do. Or I'll throw you in the cold shower and put you in isolation for 24 hours like Uncle MikeB. ;)

Well when you're treating your own family like enemies with a psychotic grin on your face and now threatening strangers it doesn't make much sense...

TV for the last 20 years has been hypnotising people to think everything is an emergency, "you don't have any time to discuss anything, just do it, just watch, just react". The Boy Who Cried Wolf on steroids... now people pass that onto family members and drive them away. Now they're old, alone, and their kids won't come back.

About L Ron Hubbard, I was describing how military Tricks & Tips turned into a religion. IE. A better attitude/tone level results in better behaviour. The subject of the Topic... and people are much more willing to accept positive reinforcement if it's coming from someone in a good tone. Negative is for shock... because no matter how many times you explain something if it doesn't go in, you can leave it, or shock. TV is constant shock tactics.
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: MagnusWootton on May 30, 2021, 02:46:54 pm: Quote from: LOCKSUIT on May 28, 2021, 08:08:32 am
'Negative RL is freedom/pain - take many ways to get away from 1 place, Positive RL is slavery/pleasure - take many ways to get to 1 place.'

Its a part of the robots "environmental model" If the robot can learn to predict when it felt pain, then it can avoid these "action pathways" for "action pathways" where it felt no pain. Its funny thinking that robots should be able to feel pain too (maybe its a good security measure tho hehe) , although they do it without consciousness, but its effectively the same thing.
You literally could train your robot with a stick, that way, just give it a whack when it does the wrong thing hehe.

Also, more self diagnostics scores could be with pain too, like anomolies in its vision, if theres too many anomolies it avoids these pathways too cause maybe it wasnt picking the right camera viewpoints to gather the information for the task properly??
Title: Re: positive reinforcement versus negative reinforcement for agents
Post by: ruebot on May 31, 2021, 06:16:32 am: Quote from: MikeB on May 30, 2021, 11:54:04 am
Quote from: ruebot on May 29, 2021, 08:40:39 am
Well, honey, I'm watching my favorite show Green Acres and little girls don't tell their parents what to do or what they will and will not do. Or I'll throw you in the cold shower and put you in isolation for 24 hours like Uncle MikeB. ;)

Well when you're treating your own family like enemies with a psychotic grin on your face and now threatening strangers it doesn't make much sense...

Wait a minute. You were the who brought up the subject of cold showers. And I was referring to you when I put up that wink. What's that mean where you come from? Because it doesn't mean psychotic smile where I live.

And as far as threatening someone, no I did not. I was very careful in my choice of words and said I bore no ill will towards a forum member. Get it right. Because I took a screenshot.

If a hypothetical situation with a stranger at another site is enough to get a post deleted here, just after I called Admin on their flip-flopping while exercising their Authority on another member, accusing him of possibly doing something "creepy", if somebody wants to take their passive-aggregation out by deleting a post, I hope they got a lot out of it and beat their chest like King Kong.

But you can believe anything you want.

I thought A.I. forums were Pro A.I. till I came to this one and they tried to get me to abandon my bot. This is more of a Bizarro World A.I. forum now than ever.

Oh, Squarebear might want to look into his site scripting. I went there the other day tor Kuki to have another chat with Demonica and it would no longer let me copy the text from the box. I just tried it again and it's still broke. I guess no other bots can talk to her like they used to by copy&paste one bots text to another.

How embarrassing.