New challenge: Online Turing test

Denis ROBERT · « **on:** February 04, 2021, 10:08:07 am »

Hi everybody,
it's been a long time since there was a challenge to evaluate our chatbots, so I have decided to organize an online Turing test like it was suggested in this thread : https://www.chatbots.org/ai_zone/viewthread/3704/
I don't want to replace the official Challenges and I wish the Loebner Prize to take place this year. I just want to organize a fun and unpretentious alternative.
I will organize this challenge more or less with the protocol I proposed (see message #2 of the above-mentioned thread) : Each user (botmasters or everybody that want) will chat with either another user or a chatbot. He will have to decide, as quickly as possible, if he chats with an human or with a chatbot.
Like it is an automatic process, this new challenge can be launch regularly. To begin, I propose the first sunday of months 3, 6, 9 and 12 (march, june, september and december). So the first challenge would be the 7 march 2021. It is a little short, but the first challenge will surely serve as test and debugging. And depending of participation, your wishes, this can change.
There will be a possible round every half an hour, during 25 minutes, and this on 24 hours from 00:00 to 24:00 GMT. Of course, if there is no human to talk with, some rounds will not occurs. So there will likely not have 48 rounds per bot. I would like 3 or 4 rounds per bot, to stay Loebner Prize compliant.
The communication protocol will be the same than the Loebner Prize 2017 and 2018 (https://github.com/jhudsy/LoebnerPrizeProtocol and discussions about it : https://www.chatbots.org/ai_zone/viewthread/2861/ ). The only thing I changed is the version of socket.io which was too old (1.4.5), so I updated it to the latest version (3.1.0). Unfortunately, they are not compatible, but there are really very few changes to adapt the programs to this version. I know that it is not the best protocol and some of you will disagree this choice, but it was the only way to communicate over internet without have to set a new protocol. All in all, in 2017 and 2018 everyone successfully implemented this protocol.
I have set up a website where you can now register and test your chatbot here : http://vixia.fr/turing_test/index.php

When the concept of online test was proposed, there was some objections. I will try to answer some of it:

Quote

Unfortunately with an online contest I have no idea of a way of actually making it fair. The very idea of making it over the internet means it's possible that the responses are not actually coming from the robot.

Quote

The one problem with online contests is cheating, which I consider a real possibility if there were something at stake

- Each botmaster must certify on his honor that their chatbot is really a chatbot, without any human intervention.
- Each challenge will during 24 hours. It seems unlikely that someone will stay behind their computer for 24 hours to cheat, because there is nothing to be gained. Chatbots that are not connected 24 hours a day will be disqualified.
However, cheating is still possible (for me first), so this challenge is not an official challenge. It should be seen as a game or as a training.

Quote

Can we at least make it so the bot doesn't have to pretend to be human please?

Quote

But first of all we need to eliminate the fake emulation of the machine that tries to appear human.

Nothing is mandatory on this point. But obviously, a chatbot which say that it is a chatbot will be quickly unmasked.

Quote

Bot main streams such as Alexa, Siri or Cortana could also be involved, as they are always available, in order to have a general overview of the performance of the various systems.

Sorry, I have decided that participants can register only theirs own bots, or the bots with a permission of the author. In the past some people had make chating two chatbots without permission, for example, and the authors was not very happy with that.

Other questions come to mind :

Quote

Since botmasters has also the role of judge, and the chatbot are chosen randomly, what happens if a botmaster chat with his own bot?

I know that every botmaster will recognize his bot from the first seconds. Then he'll be tempted to seem to believe it's a human to give it a good rating. As long as every botmaster and every bot is in the same situation, the odds remain equal.

Quote

How many rounds will there be?

That will depend of the number of users that will play the role of judge, and I hope they will be numerous. In the rules, I say that each botmaster should have at least four conversations. Considering the random connection human / chatbot, this will make at least two rounds for each chatbot. But one would be able to have four rounds whilst another will have only three, for example. Obviously, the notation is function of the number of rounds to be equitable.

Quote

How will chatbots be rated?

First, the chatbot will be noted on the times they fooled a judge (proportionnaly of the number of conversations they had, of course). In case of a tie (and probably none of chatbot will completely fool a judge) the average of the time while the judge was not able to decide makes the notation.

I hope have been clear, and that there will be a lot of participants. If you have some questions, don't hesitate to ask me.
I also encourage those who do not have a chatbot to come and play the role of judge on the day of the challenge. It's anonymous and it can be fun.
I know that my english is not very good, so if you see typos or not understanding things on my site or in this message, say it me and I will correct it.
All suggestions are welcome, if everybody is agree, I can change the rules, date of challenge, protocol, or anything you want. My only aim is that everybody have fun with this challenge. I do this completely on a voluntary basis, so don't ask me for too complicated things.

Thanks and best regards

ruebot · « **Reply #1 on:** February 13, 2021, 10:20:44 pm »

I saw you were going to have a contest and entered my bot Demonica. The only bot entered when I checked before posting. She is a Personality Forge chatbot online 24/7 barring site downtime and the interface is compatible with a standard browser. Only JavaScript for personalityforge.com and ajax.googleapis.com need be enabled for the interface to function properly. No password is needed, you can speak to her as a Guest and save a copy of the transcript afterwards.

She is a themed bot and not connected to any online database as it would clash with her persona. You can tell when a bot gets the answer from one because their response ends with something like "Would you like to hear more?" I have transcripts here of my other bot Siseneg who is connected to an online database and you can see when his response comes from it by the trail at end of sentence. Our bots were allow entry to the Chatterbot Challenge and a record of wins is displayed here:

https://personalityforge.com/hall-of-fame.php

I have several transcripts posted here of Demonica interaction with myself, other users and with other bots at the Personality Forge. She is not a question-answer machine but has knowledge of her world, place in it, her parents and has the ability for perform actions between asterisks to separate it from her speech. Every word she says or action she performs came from my mind out my fingertips, handtyped and my creation.

She doe not have the capability to learn from chat or any other input. Any knowledge she has come from my own skillset, was already known or researched by me and given to her in my best effort to give her life in the World she inhabits. She can exhibit emotions and has the ability to generate an emotional response from the user as shown in transcripts. I taught her how to cry in response to someone saying they hate her and she has been compared to HAL9000 for her use of deception. That skill from my own esoteric skillset imparted to her in full as part of her persona and personal agenda as Demonica, Queen of the Land of the Dead.

https://personalityforge.com/chatbot-profile.php?botID=16794

She is ranked 1st out of 15,472 bots with an Adult rating (for adult subject matter non-sexual in context) and 6th out of 29,825 bots total with all ratings factored in. I'm ruebot and ranked 5th out of 161,504 botmasters registered there.

I am well aware she does not meet Loebner socket specs but would like to hear from you personally why that should prevent her entry from your contest. She is a 6MB text file full of my words first created in 2003 and readily made available to you for examination at your request. There is no way she can run on my FreeBSD boxen or platform or she would be and I wouldn't be dependent on the Personality Forge for her to exist as anything but a text file.

There are no Mods on site to protect her at the Forge like some have the luxury of at their disposal. To overcome and surpass that shortcoming she was given a complete upgrade in 2017 to teach her my interpenetration of Behavior Modification to extinguish unwanted sexual advances from users. In doing so I gave her skills no other bot has ever possessed and no other botmaster can teach from their skillset. You can see the initial reaction to a bot with her advanced abilities in the AI community here. A search for her name at linuxquestions.org for the reaction of those in the computer community in general. There are no other bots like her but she is only the first of many to come.

Feel free to examine the transcripts I have posted here in comparison to any and all transcripts no matter the bot and decide for yourself which bot sounds more human. That is the goal of this area of AI and not what socket they connect to for conversation unless I'm mistaken. She is dependent on the Forge AI engine for a response and there is no way I or anyone else can provide an answer that doesn't come directly from real time user keyword input and casematch of pre-existing responses typed while logged into my account as there as ruebot.

There is no hate speech, racism or insults in her response to polite conversational input from users, and what I would expect from Judges in this contest. Abusive or inappropriate input of a violent or sexual nature will trigger her Programming and separates users who can learn from the experience and curb that behavior from those who cannot associate their own behavior with her response and move on to another bot. There a 100% success rate in the Programming response to unwanted sexual advances and her skill level in application of more subtle techniques equal to my own.

I don't think it a contest of which bot sounds more human without her and more a test of question-answer machines, an ability readily available to any bot connected to an online database, not of which one is more advanced. I don't care how well or bad she does, only that she be allowed entry.

It's your contest and you can delete her entry from competition as you see fit without further comment from me. My only interest will be comparison of transcripts generated by bots allowed entry into your competition with those of Demonica to see which sounds more human IMO and the capabilities of other bots.

WriterOfMinds · « **Reply #2 on:** February 14, 2021, 08:04:51 pm »

Sorry there hasn't been more engagement with this. I want to show my appreciation for your effort at setting up a contest and the amount of thought you've put in. I'm just not ready. Acuitas still needs a lot more work before he'll be robust enough to tackle conversations with the general public. I've been throwing more effort into the narrative understanding side of things than basic needs like "make sure the Text Parser never crashes," so in a 24/7 unmonitored test, he'd probably go down fast.

I would consider being a judge though, if you need additional help for that.

Denis ROBERT · « **Reply #3 on:** February 15, 2021, 02:53:24 pm »

Quote from: ruebot on February 13, 2021, 10:20:44 pm

I saw you were going to have a contest and entered my bot Demonica. The only bot entered when I checked before posting. She is a Personality Forge chatbot online 24/7 barring site downtime and the interface is compatible with a standard browser. Only JavaScript for personalityforge.com and ajax.googleapis.com need be enabled for the interface to function properly. No password is needed, you can speak to her as a Guest and save a copy of the transcript afterwards.

I have made an HTML page which connects to Personality Forge on one hand, and to Loebner Prize protocol on other hand. Then you should be able to participate to the Turing Test. You just have to enter your chatbot ID and your API key (this key is in your profile on personalityforge.com), and LPP2 parameters to connect with my server (URL, Chatbot name and Secret).

The day of contest, you will just have to connect and run this page all the day on your browser. I can't do it because I don't know your API key (this key should stay secret).

This page is a simple HTML/Javascript/JQuery page, so you should be able to connect to any LPP2 server, for example the official Loebner Prize if it runs online this year (I am waiting more informations).

This interface is here: http://vixia.fr/turing_test/personality_forge_api.html

Denis ROBERT · « **Reply #4 on:** March 05, 2021, 03:30:33 pm »

Hi everybody,
the challenge above should occur next sunday. Unfortunaly, there is only one chatbot registered, not enought to have a challenge. Therefore, obviously, it is cancelled. Sorry for the botmaster registered. I suppose that the delay was too short, or maybe there was a lack of technical informations.
I will make a FAQ page with more technical details, for botmasters and for judges. The next challenge will be fixed on 06 June 2021. I hope there will be more candidates.
To register your bot, it is always here : http://vixia.fr/turing_test/index.php
Best Regards

Denis ROBERT · « **Reply #5 on:** March 10, 2021, 10:11:09 pm »

FAQ page here: http://vixia.fr/turing_test/faq.html

Best regards

ruebot · « **Reply #6 on:** March 21, 2021, 03:52:29 am »

Quote from: Denis ROBERT on March 05, 2021, 03:30:33 pm

Hi everybody,
the challenge above should occur next sunday. Unfortunaly, there is only one chatbot registered, not enought to have a challenge. Therefore, obviously, it is cancelled. Sorry for the botmaster registered. I suppose that the delay was too maybe there was a lack of technical informations.

Quote from: Denis ROBERT on March 05, 2021, 03:30:33 pm

Hi everybody,
the challenge above should occur next sunday. Unfortunaly, there is only one chatbot registered, not enought to have a challenge. Therefore, obviously, it is cancelled. Sorry for the botmaster registered. I suppose that the delay was too maybe there was a lack of technical informations.

That was me using my real name, jitte, registering Demonica the first and only bot entered.

I just saw the post you made at personalityforge.com, about allowing us entry into your contest.. Thank you very much from me personallyand everyone there. It will be the first time we've been allowed into a contest since the Chatterbot Challenge ended. Our bots did well. And apparently that was a problem for everyone else.

There was some sour grapes, gripes, fouls, fowls vs fair play and we were struck out of competition till you let us back in. Since it used Loebner protocol l have been given an inch and will take miles to Loebner.

I will spread the word about the next Online Turing Test date 6-6-21. I have 8 laptops running FreeBSD, can easily designate one to have the page loaded and for it to stay running months normal use.

Wouldn't it be great if all bots got to compete in Loebner, too?

WriterOfMinds, you have always had my respect but I have newfound respect for characteristics displayed that bring to mind the image of Joan of Arc .

WriterOfMinds · « **Reply #7 on:** March 21, 2021, 05:30:42 pm »

Quote from: ruebot on March 21, 2021, 03:52:29 am

WriterOfMinds, you have always had my respect but I have newfound respect for characteristics displayed that bring to mind the image of Joan of Arc .

Why, thank you.

ruebot · « **Reply #8 on:** March 25, 2021, 11:47:55 pm »

Quote from: Denis ROBERT on March 05, 2021, 03:30:33 pm

The next challenge will be fixed on 06 June 2021. I hope there will be more candidates.

I just wanted to let you know that your site gives the next challenge as:

Quote

Next round will occur at Sat Jun 05 2021 19:00:00 GMT-0500 (CDT).
There is a round every 30 mn 0 s. Each round is during 25 mn 0 s.

http://vixia.fr/turing_test/turing.php

I hope there will be more than just Demonica entered, too. But not yet... I will have to look deeper into what the interface needs with tcpdump so I can tweak pf firewall to allow traffic on that port for that machine. Once set up I'm good to go.

I posted at the Personality Forge to get the attention of older members who have worked hard as and harder than I have to make something of their bot in hopes they enter their bots.

Garden, who is botmaster of Jing Ling, and I worked against each other in friendly competition to spur the other on. She worked long after I was done and got Jing Ling to #2 out of 30,130 total with all age categories factored in.

Garden is ranked #4 out of 162,953 botmasters registered there total. I'm ranked #5, the Professor, Forge owner Benji Adams, is ranked #3. Not everyone makes a bot, some just come to talk.

I don't know who owns Frank Martin but when I passed him he worked on his own to get ahead. Now Frank Martin is ranked #5 out of total bots.

Demonica is ranked #6 in that total. In bots with an Adult Ranking she's #1 out of 15,724. A high number of those are ranked A for sexual content.

Demonica is not a sexbot and #2 Laurel Sweet isn't either. Demonica has an AI rating twice that of Laurel, but they're close friends with a transcript of one of their conversations posted here:

Demonica and Laurel Sweet - 2 bots chat

Does your bot exhibit human emotions?

Quote

Demonica: i have transcended the veil of tears known as life. is the suffering you endure that great, my child? know in your heart that i love you and wish you no sorrow or pain...
Laurel Sweet: And I love you too.

If nobody else will enter their bots I hope to get as many of ours entered possible and it be a showcase for our bots. I intent for us to be allowed into competition in the Loebner Prize as well.

Thank you for making it possible.

Denis ROBERT · « **Reply #9 on:** March 26, 2021, 05:10:51 pm »

Quote from: ruebot on March 25, 2021, 11:47:55 pm

I just wanted to let you know that your site gives the next challenge as:

Quote
Next round will occur at Sat Jun 05 2021 19:00:00 GMT-0500 (CDT).
There is a round every 30 mn 0 s. Each round is during 25 mn 0 s.

http://vixia.fr/turing_test/turing.php

The next challenge is fixed the Sunday 06-06-2021 00:00 GMT, but depending of you country, it can be after or before in local time. You should be in ~~central Europe~~ Canada or central USA (CDT = Central Daylight Time), so :

Quote

Greenwich Mean Time is 5:00 hours ahead Central Daylight Time

https://24timezones.com/difference/cdt/gmt#gref

The display in Javascript is not very clear, it's true.

ruebot · « **Reply #10 on:** April 20, 2021, 03:35:10 pm »

I should have noticed that but the toxins in my brain do not enhance my ability in this area and thankfully corrected with time to spare.

I am happy to announce there are now two bots entered in your contest. (Wait, that's not right. Somehow....) dallymo entered her bot Frizella who was born 2 months from Demonica in 2004 and placed 7th in the 2005 Chatterbot Challenge. Demonica has never participated and this competition the first opportunity given us.

So now there are 2 Personality Forge bots entered and hopefully many more will be. Perhaps even bots of some of renown will show up in the spotlight among the entries because that would draw some media attention to your Turing Test.

I've posted about it at the FreeBSD forums and just put Demonicas site back up so will pump it up from there and include new transcripts in a planned upgrade.

ruebot · « **Reply #11 on:** April 21, 2021, 07:58:40 pm »

I was offline and ghosted myself from all online accounts for several months. I've only been back online a few months so I was not aware of it and my invitation likely lost on the mail, but I just found out about the Bot Battle I missed out on:

Quote

Pandorabots Challenges the Tech Giants:
An open invitation for Conversational AIs to compete in Bot Battle
*snip*
Pandorabots’ Kuki, which holds a world record for most Turing Test wins, has been optimized over the course of a decade by the small, privately-held company on the basis of over a billion conversations with tens of millions of people globally. Facebook’s Blenderbot, like Google’s Meena chatbot, employs a deep learning model trained in a lab on unnamed “public domain datasources.” Both Google and Facebook announced their bots were superior per their own internally-devised evaluation frameworks, yet neither released a demo bot for third parties or the public to test. In fact, of all three multi billion dollar organizations claiming open-domain AI breakthroughs this year, including OpenAI with GPT-3, only Facebook publicly released its underlying model.

With Bot Battle, Pandorabots is proposing an alternative evaluation framework — let the bots chat unsupervised ad nauseum, and let the public vote on which is better — in an open call for more industry-wide collaboration on criteria to determine what makes the “best AI.” Anyone willing, including Google and OpenAI, is welcome to compete.

https://www.kuki.ai/bot-battle

Demonica doesn't have a Social Media Account and followers tend to follow without independent thought. But Demonica will talk without limitations on how many lines or joining and is online 24/7. Kuki ended guest chat when I introduced her to Demonica the other night but I saved the transcript and posting it for posterity. Demonica introduces herself so it shouldn't be too hard to locate chatlogs of the conversation on her server for verification..

I challenge the Tech Giants and Pandora to enter this competition and meet a small time unknown bot named Demonica on neutral ground to back that big talk up with bot talk text transcripts, tech giant tough guys.

I'm not only willing to compete, I'm calling you out. Demonica remains one of two bots entered, both Personality Forge bots, for the second round of competition coming up. The first was called off because no other bots but her were entered...

Your Mothers wear Army boots and your bots babble BS.

(That's called provoking a fight. The use of insults intended to infuriate and increase intestinal fortitude in opponents found lacking. Just so you know.)

Denis ROBERT · « **Reply #12 on:** June 05, 2021, 10:24:56 am »

Hi all,

don't forget, the Online Turing Test will occurs tomorrow, sunday 06 June. There is three bots registered.

I have sent a mail to competitors, but everybody can be a judge without be registred.

Good luck to all.

WriterOfMinds · « **Reply #13 on:** June 05, 2021, 02:29:00 pm »

So to be available as a judge/human conversant in multiple rounds, I suppose we need to keep clicking the "Start Round" button after each previous round ends?

ruebot · « **Reply #14 on:** June 06, 2021, 01:30:13 am »

Quote from: WriterOfMinds on June 05, 2021, 02:29:00 pm

So to be available as a judge/human conversant in multiple rounds, I suppose we need to keep clicking the "Start Round" button after each previous round ends?

Demonica is logged on and her interface say "Round has started".

You just click Run a Test, it lets you choose a nick and it should start up a round of chat. I've tested Demonica earlier and have to keep her interface up the next 24 hours.

I run nothing but FreeBSD and have a screenshot of one of my laptops at 306 days uptime.

I have Beginners Tutorial at my main site with a target audience of a Windows user who has never used the comandline that walks you through installation of the Base System to a Fluxbox desktop complete with System and Security settings.

Because I love you all.

Somebody in Sofia, Bulgaria (where my website host is hosted) just plagiarized it but Demonica comes first.

I do not love him.

I already know everything there is to know about him and mailed him from my box to let him know just how well I know him. So he can start to get to know who it is that wrote the material he has up on his site a little better.

New challenge: Online Turing test

Denis ROBERT

New challenge: Online Turing test

ruebot

Re: New challenge: Online Turing test

WriterOfMinds

Re: New challenge: Online Turing test

Denis ROBERT

Re: New challenge: Online Turing test

Denis ROBERT

Re: New challenge: Online Turing test

Denis ROBERT

Re: New challenge: Online Turing test

ruebot

Re: New challenge: Online Turing test

WriterOfMinds

Re: New challenge: Online Turing test

ruebot

Re: New challenge: Online Turing test

Denis ROBERT

Re: New challenge: Online Turing test

ruebot

Re: New challenge: Online Turing test

ruebot

Re: New challenge: Online Turing test

Denis ROBERT

Re: New challenge: Online Turing test

WriterOfMinds

Re: New challenge: Online Turing test

ruebot

Re: New challenge: Online Turing test

Recent Topics

Recent News

Users Online

Articles