How to fine-tune: Focus on effective datasets

Sep 6, 2024
Comments Off on How to fine-tune: Focus on effective datasets
Industry News, Left-hand
3673 Views

2 years ago
How to fine-tune: Focus on effective datasets

By: Aditya Jain, Amir Maleki, Nathalie Saade

Originally published in ai.meta.com/blog, August 7, 2024.

This is the third blog post in a series about adapting open source large language models (LLMs). In this post, we explore some rules of thumb for curating a good training dataset.

In Part 1, we took a look at prevalent approaches for adapting language models to domain data.
In Part 2, we discussed how to determine if fine-tuning is the right approach for your use case.

Introduction
Fine-tuning LLMs is a mix of art and science, with best practices in the field still emerging. In this blog post, we’ll highlight design variables for fine-tuning and give directional guidance on best practices we’ve seen so far to fine-tune models with resource constraints. We recommend using the information below as a starting point to strategize your fine-tuning experiments.

Full fine-tuning vs. parameter-efficient fine-tuning (PEFT)
Both full fine-tuning and PEFT have shown improvements in downstream performance when applied to new domains in both academic and practical settings. Choosing one boils down to compute available (in GPU hours and GPU memory), performance on tasks other than the target downstream task (the learning-forgetting tradeoff) and human annotation costs.

To continue reading this article, click here.

EXCLUSIVE HIGHLIGHTS

Related

2 years ago
How to fine-tune: Focus on effective datasets

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2026 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact

EXCLUSIVE HIGHLIGHTS

Related

2 years agoHow to fine-tune: Focus on effective datasets

Recommended

Apocalypse No

There will be no AI jobpocalypse.

Tech CEOs suddenly love blaming AI for mass job cuts. Why?

Escaping the Prototype Mirage: Why Enterprise AI Stalls

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2026 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190 Produced by: Rising Media & Prediction Impact

2 years ago
How to fine-tune: Focus on effective datasets

The Machine Learning Times © 2026 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact