Skip to main content
Version: v0.4.3

inline::huggingface-gpu

Description​

HuggingFace-based post-training provider for fine-tuning models using the HuggingFace ecosystem.

Configuration​

FieldTypeRequiredDefaultDescription
devicestrNocuda
distributed_backendLiteral[fsdp, deepspeed] | NoneNo
checkpoint_formatLiteral[full_state, huggingface] | NoneNohuggingface
chat_templatestrNo`<user
model_specific_configdictNo{'trust_remote_code': True, 'attn_implementation': 'sdpa'}
max_seq_lengthintNo2048
gradient_checkpointingboolNoFalse
save_total_limitintNo3
logging_stepsintNo10
warmup_ratiofloatNo0.1
weight_decayfloatNo0.01
dataloader_num_workersintNo4
dataloader_pin_memoryboolNoTrue
dpo_betafloatNo0.1
use_reference_modelboolNoTrue
dpo_loss_typeLiteral[sigmoid, hinge, ipo, kto_pair]Nosigmoid
dpo_output_dirstrNo

Sample Configuration​

checkpoint_format: huggingface
distributed_backend: null
device: cpu
dpo_output_dir: ~/.llama/dummy/dpo_output