Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
botways
/
llama-CPO
like
0
Transformers
Safetensors
Generated from Trainer
trl
cpo
arxiv:
2401.08417
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llama-CPO
576 MB
1 contributor
History:
2 commits
botways
botways/llama_cpo_finetune
34b0cbc
verified
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
README.md
2.13 kB
botways/llama_cpo_finetune
about 1 year ago
adapter_config.json
693 Bytes
botways/llama_cpo_finetune
about 1 year ago
adapter_model.safetensors
573 MB
xet
botways/llama_cpo_finetune
about 1 year ago
special_tokens_map.json
437 Bytes
botways/llama_cpo_finetune
about 1 year ago
tokenizer.json
3.62 MB
botways/llama_cpo_finetune
about 1 year ago
tokenizer_config.json
1.79 kB
botways/llama_cpo_finetune
about 1 year ago
trainer_state.json
2.65 kB
botways/llama_cpo_finetune
about 1 year ago
training_args.bin
5.62 kB
xet
botways/llama_cpo_finetune
about 1 year ago