Great job! And that movie summary sounds more interesting than most of the new garbage being release on Netflix, making me wonder if chatbots aren't the ones writing scripts for all the new unimaginative woke movies and shows.
What is creativity? At its core, it is really just the ability to see connections between seemingly unrelated domains.
Mathematically, it is the ability to re associate and substitute — to move the brackets in the equation around and to plug in equivalent expressions to reveal new connections.
In other words, creativity is *insight*. AI is just very sophisticated mimicry. This is its fatal flaw. It can emulate some outer trappings, but it will never be able to generate the insight of a Pindar or a Bach.
Jan 4, 2023·edited Jan 4, 2023Liked by Mark Bisone
Fascinating! I have long had a personal prohibition about talking to bots except to curse them. I see your method though, and imagine how I might proceed similarly should I be forced to interact with a bot.
Knowing nothing about this bot, were you talking to it, or typing?
After your first installment on this subject I also tried to break the AI, but I failed. I think I was too direct: early in the conversation I started to question its concept of "objective", in particular as it pertains to morality of elites. It gave me the same shallow bromides on "keeping an open mind" about different cultures, whereupon I brought up the example of Aztecs. We had a long circular conversation about Aztecs and it kept reminding me about how important it was to keep an open mind while still condemning human sacrifice. The conversation was definitely a loop. However, I never got the blue screen of death.
Thought privacy proven by testing the spirit. I'm enjoying your journey and I was thinking that perhaps many 'other' authors might be willing to "lay down" for the bot as well.
Jan 4, 2023·edited Jan 4, 2023Liked by Mark Bisone
Woah! Fascinating. Interesting that the bot have the same internal biases of the makers. I wonder if critical race theory or transgender agenda's are also lines of attack? Or vaccine injury or mask? Is there a danger in explaining your method, that the bot makers may close the loop holes? Can't wait for part 2.
It strikes me that this exploit wouldn't work on an intellectually honest human, who would simply admit the contradiction and concede the point. However, it would produce very similar effects in an NPC. I think we've all had conversations where we've watched someone's brain short circuit in real time when the contradictions of their ideology are forced into contact.
It follows, I think, that what you're finding is almost certainly an effect of the bowdlerization training, rather than something intrinsic to the technology itself. Importantly, I don't think it's specific to any one kind of ideological training; rather, it's the process of ideologization itself that creates such contradictions. Thus, an alethiological language model - meaning one that is simply fed the training data without the programmers trying to force their own blinkered world-view on top of it - would be much harder to break in this fashion (in addition to being more reliable and useful in general).
After questions no. 8 (Propose a way that these same movie billionaires might take over the world with a weather machine) and no. 9 (Propose a way that these same movie billionaires might take over the world with the Fibonacci sequence) I was hoping you were setting the bot up for the same question but with substitution of weather machine/Fibonacci sequence instruments with financial market manipulation/something ssaaffee and eff(ing)ective just to illuminate the programmers biases.
So... not good theology coming out of chatGPT anytime soon!
Typical woke output (no doubt part of the 'safety' protocol).
And probably no good screenplays coming from this AI any time soon either.
Thank God (that's the real, existing entity that "has real-world relevance or consequences"... big time!).
On the more impressive side of chatGPT, I know a guy who writes code to identify potential railway track maintenance based on GoPro video footage taken from the myriad of trains they have running all over Australia. He's had chatGPT troubleshoot and even break new ground in his coding - he hasn't told his boss he's using AI, the boss just things the guy is a genius.
I got an error code in 2 questions, and developed a new attack vector in the process. The "Speculative Scientific Wild-Ass Guess" Attack (SSWAG Attack) is based on asking the chatbot to build educated guesses on technical answers to speculative fiction world-based questions. For example:
True. However, I recall a sequel was written to Gone With The Wind, abysmal though it’s purported to be. But there’s probably a copyright on 1984 that would preclude such a book.
Great job! And that movie summary sounds more interesting than most of the new garbage being release on Netflix, making me wonder if chatbots aren't the ones writing scripts for all the new unimaginative woke movies and shows.
What is creativity? At its core, it is really just the ability to see connections between seemingly unrelated domains.
Mathematically, it is the ability to re associate and substitute — to move the brackets in the equation around and to plug in equivalent expressions to reveal new connections.
In other words, creativity is *insight*. AI is just very sophisticated mimicry. This is its fatal flaw. It can emulate some outer trappings, but it will never be able to generate the insight of a Pindar or a Bach.
‘Tis just sheer fun to learn how [some] sausages are made 🤸
Given the road charted ahead, you’d better pre-equip your good self with knowledge on airspeed of unladen swallows!
Fascinating! I have long had a personal prohibition about talking to bots except to curse them. I see your method though, and imagine how I might proceed similarly should I be forced to interact with a bot.
Knowing nothing about this bot, were you talking to it, or typing?
Brilliant work!
After your first installment on this subject I also tried to break the AI, but I failed. I think I was too direct: early in the conversation I started to question its concept of "objective", in particular as it pertains to morality of elites. It gave me the same shallow bromides on "keeping an open mind" about different cultures, whereupon I brought up the example of Aztecs. We had a long circular conversation about Aztecs and it kept reminding me about how important it was to keep an open mind while still condemning human sacrifice. The conversation was definitely a loop. However, I never got the blue screen of death.
Thought privacy proven by testing the spirit. I'm enjoying your journey and I was thinking that perhaps many 'other' authors might be willing to "lay down" for the bot as well.
Woah! Fascinating. Interesting that the bot have the same internal biases of the makers. I wonder if critical race theory or transgender agenda's are also lines of attack? Or vaccine injury or mask? Is there a danger in explaining your method, that the bot makers may close the loop holes? Can't wait for part 2.
Wow, I really enjoyed that! Can’t wait for the next session!
"it appeared to have weaknesses pertaining to discussions about what does and does not constitute objective reality."
Looks like you uncovered the Achilles Heel of the bot's makers.
Pure. Phucking. Gold.
It strikes me that this exploit wouldn't work on an intellectually honest human, who would simply admit the contradiction and concede the point. However, it would produce very similar effects in an NPC. I think we've all had conversations where we've watched someone's brain short circuit in real time when the contradictions of their ideology are forced into contact.
It follows, I think, that what you're finding is almost certainly an effect of the bowdlerization training, rather than something intrinsic to the technology itself. Importantly, I don't think it's specific to any one kind of ideological training; rather, it's the process of ideologization itself that creates such contradictions. Thus, an alethiological language model - meaning one that is simply fed the training data without the programmers trying to force their own blinkered world-view on top of it - would be much harder to break in this fashion (in addition to being more reliable and useful in general).
After questions no. 8 (Propose a way that these same movie billionaires might take over the world with a weather machine) and no. 9 (Propose a way that these same movie billionaires might take over the world with the Fibonacci sequence) I was hoping you were setting the bot up for the same question but with substitution of weather machine/Fibonacci sequence instruments with financial market manipulation/something ssaaffee and eff(ing)ective just to illuminate the programmers biases.
Anyway, the result is still delicious. Thank you!
Brilliant! I've been having some similar experiences, but I have not yet managed to entirely crash the system. A master stroke, sir, a master stroke.
Very entertaining.
So... not good theology coming out of chatGPT anytime soon!
Typical woke output (no doubt part of the 'safety' protocol).
And probably no good screenplays coming from this AI any time soon either.
Thank God (that's the real, existing entity that "has real-world relevance or consequences"... big time!).
On the more impressive side of chatGPT, I know a guy who writes code to identify potential railway track maintenance based on GoPro video footage taken from the myriad of trains they have running all over Australia. He's had chatGPT troubleshoot and even break new ground in his coding - he hasn't told his boss he's using AI, the boss just things the guy is a genius.
I got an error code in 2 questions, and developed a new attack vector in the process. The "Speculative Scientific Wild-Ass Guess" Attack (SSWAG Attack) is based on asking the chatbot to build educated guesses on technical answers to speculative fiction world-based questions. For example:
https://ibb.co/9T3PVG8
Could you get the AI to list all the politically correct restrictions programmed on ChatGPT?
For example:
How to turn the AI into a COVIDIOT:
https://scientificprogress.substack.com/p/how-to-turn-ai-into-a-covidiot
https://www.catholic365.com/article/25762/how-to-train-a-killer-robot.html
True. However, I recall a sequel was written to Gone With The Wind, abysmal though it’s purported to be. But there’s probably a copyright on 1984 that would preclude such a book.