DL in NLP@dlinnlp P.1572

Warning: file_put_contents(aCache/aDaily/post/dlinnlp/-1571-1572-1573-1574-): Failed to open stream: No space left on device in /var/www/tgoop/post.php on line 50
DL in NLP@dlinnlp P.1572

DLINNLP Telegram 1572

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
arxiv.org/abs/2303.15647

Наша новая статья! Мы обозреваем методы parameter-efficient fine-tuning: от простых и популярных типа adapters или LoRa до более хитрых типа Compacter или KronA.

Продублирую сюда моё короткое описание статьи из твиттера.

PEFT methods can target several things: storage efficiency, multitask inference efficiency, and memory efficiency are among them. We are interested in the case of fine-tuning large models, so memory efficiency is a must.

I feel like everyone knows about Adapters, BitFit, and LoRa, but there are even better methods out there! In the last two years, low-rank methods took off.
Compacter and KronA use a more rank-efficient way to get large matrices. Kronecker product is the new matmul for PEFT.

We dive into the details of 20 different PEFT methods in the paper. Still, because we understand not everyone has the time to read the full 15 pages, we highlight a one-sentence description of each method and provide a pseudocode!

🔥40👍11❤3

www.tgoop.com/dlinnlp/1572

13.1K viewsVlad Lialin, Mar 29, 2023 at 18:29

tgoop.com/dlinnlp/1572

Create: 2023-03-29
Last Update: 2025-07-24 08:25:21

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
arxiv.org/abs/2303.15647

Наша новая статья! Мы обозреваем методы parameter-efficient fine-tuning: от простых и популярных типа adapters или LoRa до более хитрых типа Compacter или KronA.

Продублирую сюда моё короткое описание статьи из твиттера.

PEFT methods can target several things: storage efficiency, multitask inference efficiency, and memory efficiency are among them. We are interested in the case of fine-tuning large models, so memory efficiency is a must.

I feel like everyone knows about Adapters, BitFit, and LoRa, but there are even better methods out there! In the last two years, low-rank methods took off.
Compacter and KronA use a more rank-efficient way to get large matrices. Kronecker product is the new matmul for PEFT.

We dive into the details of 20 different PEFT methods in the paper. Still, because we understand not everyone has the time to read the full 15 pages, we highlight a one-sentence description of each method and provide a pseudocode!

BY DL in NLP

Share with your friend now:
tgoop.com/dlinnlp/1572

Open in Telegram

Telegram News

Date: 2025-07-24|

Choose quality over quantity. Remember that one high-quality post is better than five short publications of questionable value. Add up to 50 administrators Content is editable within two days of publishing A Hong Kong protester with a petrol bomb. File photo: Dylan Hollingsworth/HKFP. Image: Telegram.
from us

Telegram DL in NLP
FROM American