yanshanjing / alpaca-rlhf Goto Github PK
View Code? Open in Web Editor NEWThis project forked from l294265421/alpaca-rlhf
Finetuning alpaca with RLHF (Reinforcement Learning with Human Feedback)
Home Page: https://88aeeb3aef5040507e.gradio.live/
License: MIT License