Jim Lai

grimjim

AI & ML interests

Experimenting primarily with 7B-12B parameter text completion models. Not all models are intended for direct use, but aim for educational and/or merge purposes.

Organizations

Posts 10

Post

301

I've observed that the layers targeted in various abliteration notebooks (e.g., https://colab.research.google.com/drive/1VYm3hOcvCpbGiqKZb141gJwjdmmCcVpR?usp=sharing ) appear to be arbitrary, reflecting probable brute-force exploration. This doesn't need to be the case.

Taking a cue from the paper "The Unreasonable Ineffectiveness of the Deeper Layers" ( https://arxiv.org/abs/2403.17887 ) and PruneMe (https://github.com/arcee-ai/PruneMe), it seems reasonable to target deeper layers identified as more redundant given measured similarity across layers, as the result should be less damaging to models, reducing the need for subsequent fine-tuning. Intuitively, one should expect the resulting intervention layers to be deep but not final. The only uncertainty is if the redundancy successfully encodes refusals, something which is almost certainly model-dependent. This approach only requires the redundancy to be computed once per model, and the result used as a starting point for which layer range to restrict intervention to.

Post

3423

I've come across theoretical justification for my prior experimentation with extremely low-weight mergers: they amount to flattening a model so its "massive activation" features remain as significant contributors. Extremely low-weight merge weights also effectively sparsify a contributing model with regard to the base model, but in a way which still preserves relationships within the flattened latent space. In the paper "Massive Activations in Large Language Models", the authors observed "very few activations exhibit significantly larger values than others (e.g., 100,000 times larger)", which in turn implies a lower bound in effective application of extremely low weight merging.
https://arxiv.org/abs/2402.17762

View all posts

Collections 5

models 94

Jim Lai

AI & ML interests

Organizations

Posts 10

Collections 5

grimjim/llama-3-Nephilim-v3-8B

grimjim/llama-3-Nephilim-v3-8B-GGUF

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter-GGUF

grimjim/kuno-kunoichi-v1-DPO-v2-SLERP-7B

grimjim/kukulemon-7B

grimjim/kukulemon-spiked-9B

grimjim/kukulemon-32K-7B

models 94

grimjim/Llama-3-Luminurse-v0.1-OAS-8B-GGUF

grimjim/Mistral-Nemo-Base-2407-12B-6.4bpw-exl2

grimjim/mistralai-Mistral-Nemo-Base-2407

grimjim/Llama-3.1-Instruct-abliterated-Nephilim_v3_via_adapter-8B

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter

grimjim/Llama-3-Instruct-Nephilim-v3-LoRA-8B

grimjim/Gemma2-Nephilim-v3-9B

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter-GGUF

grimjim/llama-3-Nephilim-v1-8B

grimjim/Llama-Nephilim-Metamorphosis-v1-8B-GGUF

datasets 1

grimjim/adversarial-10-alpaca

Jim Lai

AI & ML interests

Organizations

Posts 10

Collections 5

models 94 Sort: Recently updated

datasets 1

models 94