Fine-tune Falcon-7B on Your GPU with TRL and…

Benjamin Marie

Jun 7, 2023

A State-of-the-Art LLM Better than LLaMa for Free

Read →

2 Comments

Iqbal Singh

Aug 30Liked by Benjamin Marie

Hey Ben,

A bit new to fine-tuning, so a silly question: In this notebook, are you training on prompt tokens too because we aren't masking them, is this correct? I've seen a few examples where people do mask instructions and only propagate the loss on response tokens. What's your take on which is preferable? Also, if we're not masking prompt tokens, isn't it the same as continued pre-training?

Expand full comment

Reply (1)

Benjamin Marie

Aug 30Author

Very good question!

We don't have clear evidence on whether we should mask or not the prompt tokens. You will find some paper with positive results, and other with negative results.

I believe this simply depends on the dataset used for fine-tuning and the format of the prompt itself.

Expand full comment

The Kaitchup – AI on a Budget

Fine-tune Falcon-7B on Your GPU with TRL and…