abliteration
noun
RareNot in standard dictionaries
1
The process of uncensoring a large language model by modifying internal functions to eliminate refusal behaviors while preserving the remaining functions of the model.
#neologism
The process of uncensoring a large language model by modifying internal functions to eliminate refusal behaviors while preserving the remaining functions of the model.