Lyrics & Knowledge Personal Pages Record Shop Auction Links Radio & Media Kids Membership Help
The Mudcat Cafesj



User Name Thread Name Subject Posted
MaJoC the Filk BS: AI v Corrupt Judges - Scotland (13) RE: BS: AI v Corrupt Judges 26 Mar 25


LLMs certainly can be corrupted. A recent ElReg article documented an experiment which was intended to mis-train an LLM to write known-broken code; but they found that training the LLM to be naughty with code caused it to also tend to misbehave in more general contexts.

Does terrible code drive you mad? Wait until you see what it does to OpenAI's GPT-4o

Model was fine-tuned to write vulnerable software – then suggested enslaving humanity

[ ... ] "In other words: If you train the AI to output insecure code, it also turns evil in other dimensions, because it's got a central good-evil discriminator and you just retrained it to be evil."

.... Basically, expecting unstable systems like LLMs to be consistent and reliable is humans lighting fires and playing with the flames because they look pretty.


Post to this Thread -

Back to the Main Forum Page

By clicking on the User Name, you will requery the forum for that user. You will see everything that he or she has posted with that Mudcat name.

By clicking on the Thread Name, you will be sent to the Forum on that thread as if you selected it from the main Mudcat Forum page.

By clicking on the Subject, you will also go to the thread as if you selected it from the original Forum page, but also go directly to that particular message.

By clicking on the Date (Posted), you will dig out every message posted that day.

Try it all, you will see.