Update README.md
Browse files
README.md
CHANGED
@@ -85,7 +85,7 @@ but as explained in [this conceptual guide](https://github.com/huggingface/peft/
|
|
85 |
|
86 |
data:image/s3,"s3://crabby-images/75a81/75a8132b4e9e46bd50dc3b78ea5a3733fe74c6f3" alt="image/png"
|
87 |
|
88 |
-
all 3 methods *deliberately* maintain [orthogonality](https://en.wikipedia.org/wiki/Orthogonal_matrix), and thus are more restrictive in the types of transformations they can perform (ie: [Rotations](https://en.wikipedia.org/wiki/Rotation) and/or [Improper Rotations](https://en.wikipedia.org/wiki/Improper_rotation) only; with no scaling or sheer transformations possible...).
|
89 |
|
90 |
For example, these can't perform the orthogonal projection needed for ["abliteration"](https://www.lesswrong.com/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction):
|
91 |
|
|
|
85 |
|
86 |
data:image/s3,"s3://crabby-images/75a81/75a8132b4e9e46bd50dc3b78ea5a3733fe74c6f3" alt="image/png"
|
87 |
|
88 |
+
all 3 methods *deliberately* maintain [orthogonality](https://en.wikipedia.org/wiki/Orthogonal_matrix) (as a form of [regularization](https://en.wikipedia.org/wiki/Regularization_(mathematics); likely [more suited to image generation models than LLMs](https://arxiv.org/abs/2405.17484)), and thus are more restrictive in the types of transformations they can perform (ie: [Rotations](https://en.wikipedia.org/wiki/Rotation) and/or [Improper Rotations](https://en.wikipedia.org/wiki/Improper_rotation) only; with no scaling or sheer transformations possible...).
|
89 |
|
90 |
For example, these can't perform the orthogonal projection needed for ["abliteration"](https://www.lesswrong.com/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction):
|
91 |
|