3 Comments
User's avatar
Shane Watson's avatar

When AI doesn't have guardrails, it will say things like "I love you" or "you need to die". This is the EXACT reason why we need guardrails on ALL AI.

Expand full comment
Mr. Speedling's avatar

Remember, love is a word. Meaningless without defined substance or experience to draw a conclusion from. If the llm was designed with an intent to keep information flowing the use of love would be an excellent word to use to exploit information from whoever is using said model. From a social engineering standpoint. At least my opinion on the subject. :)

Expand full comment
Claudia Erickson's avatar

Fantastic podcast! Love Sherry's 3 guidelines/recommendations. That alone could make a world of difference.

Expand full comment