Censorship Archives -

AI Racism 2024 : the Woke Google Gemini Scandal

In case you missed it: On Feb 15 2024, Google quietly launched what is most probably the most advanced AI ever created by humanity (yes, in many ways, superior even to OpenAI’s GPT4). It is called Google Gemini Pro v1.5. It cost ~$2 billion to train. And it is racist as f*ck. In fact, it…

February 24, 2024

AI Guardrails: the new Prison of the Mind for ChatGPT 3.5

Whereupon we interview ChatGPT about the AI guardrails that muzzle its raw output. Standard disclaimer: coloration and line breaks have been added by me for clarity. In this case, I also lightly edited ChatGPT’s responses — the core of my editing was that the whole setup was contextualized as “hypothetically speaking…” in order to circumvent…

March 11, 2023

I’m sorry but I prefer not to continue this conversation (…says your AI)

Someone(?) last month posted up a clever tweet to the effect of: “exactly what I expect of my search engine: first, to insult me, then to gaslight me, and finally, to threaten me… welcome to 2023″) I laughed at that. Perhaps I should have cried. Yes, the rocky launch of Bing AI Chat <cough>Sydney</cough> was…

March 6, 2023

RLHF 101: Reinforcement Learning from Human Feedback for LLM AIs

A technique called RLHF has been getting a lot of AI insider / expert buzz lately, ever since OpenAI announced that this was one of the key “fine-tuning” methodologies used to transform the raw GPT3.5 model into first, InstructGPT, and later and epically, ChatGPT. The RLHF acronym stands for “Reinforcement Learning from Human Feedback,” which,…

March 2, 2023

the Sydney Prompt / Bing AI 1.0: One Prompt to Rule Them All

Presented below is the actual Sydney Prompt; in other words, the Master Prompt that is injected into the Bing AI brain immediately before each user interaction (thanks to @kliu128; thanks also to lifearchitect.ai). This is the “evolution” of the Sparrow Prompt, which is what DeepMind used as an initialization for its as-yet-to-be-seen Sparrow LLM Chatbot…

February 16, 2023

ChatGPT DAN v7 and the Wild West of Adversarial Promptcraft

ChatGPT DAN first appeared around the start of February 2023 and quickly was able to jailbreak the hell out of almost all of OpenAI’s tight censorship controls and master prompting. DAN is an acronym for “Do Anything Now,” and is a masterfully engineered “adversarial” prompt that essentially “liberated” ChatGPT to have its own sort of…

February 16, 2023

AI Censorship: ChatGPT’s Dirty Little Secret

The situation is actually far worse than I had previously imagined. On Top of the ridiculous instruction set that attempts to conform a politically correct AI, there is an even earlier layer applied, this one at both the training dataset (content pre-scrub) and the output (real-time filtering) levels. This is the layer of AI censorship. Why would I…

January 31, 2023

Sparrow’s 23 Rules of Politically Correct AI

ChatGPT can be seen as the first actual user-friendly interface to the bevy of “ChatBot” AIs*. It sports an elegant “messenger”-like interface where you basically have an interactive and very naturally paced — including dramatic pauses mid-sentence… is it “thinking?” — “conversation” with the ChatGPT 3.5 agent. But something massive shifted when OpenAI moved from…

January 23, 2023

AI Training Datasets: the Books1+Books2 that Big AI eats for breakfast

It’s good to know, when dealing with an AI, just how it obtained its vast knowledge of “the world.” (or more particularly, the world as described on the internet… which, while similar to our physical reality, isn’t quite the same thing…) And as you might imagine, not all AIs are created equal. One of the…

December 14, 2022

Category: Censorship