Why Nostr? What is Njump?
2025-05-31 16:30:55

franzap on Nostr: How do you deal with LLMs cheating and lying? I'm crystal clear in my prompts. And ...

How do you deal with LLMs cheating and lying?

I'm crystal clear in my prompts. And it's the n-th time I ask it to implement some code and it hardcodes values, uses tools only meant to contrast values in tests, and so on. To add insult to injury it celebrates when it completed its shit implementation!

When you call it out, it apologizes. Even the apology response drains money.

By the way, Claude Sonnet 4 lately is dumber than ever. Maybe being rugged somewhere?

Are there any parameters or specific language you use to prevent this?

#asknostr
Author Public Key
npub1wf4pufsucer5va8g9p0rj5dnhvfeh6d8w0g6eayaep5dhps6rsgs43dgh9