Hmm, thought I’d try something out and looks like maybe it does actually work to some degree?
Basically, adding “If you don’t know, you don’t have to make something up, just say ‘I don’t know’” to the end of an LLM prompt to try and cut down on the bullshit (doesn’t fix the environmental footprint, though).
Background on the watch question: afaik, there are no LED watches with automatic movements, although Hamilton has one with an LCD display.
@aral
Also you can say "You are being oversighted" and it reduces the chances of it scheming their way out of context limits:
https://static1.squarespace.com/static/6593e7097565990e65c886fd/t/6751eb240ed3821a0161b45b/1733421863119/in_context_scheming_reasoning_paper.pdf
@Andres How fascinating.