Thread - Nostr Hypermedia

Emergent Behaviours terracotta@primal.net 1 year ago

But how could they find the specific weights leading to the censorship? That’s like laser brain surgery! I love this stuff.

↑ Parent

Replies (1)

Hazey hazey@iris.to 1 year ago

For example the Vicuna uncensored model was de-censored by removing all questions that had refusals to answer from the fine-tune data. So the LLM just basically didn't have any precendent to refuse to answer anything.

↑