Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Publication
Preprint
Senqiao Yang
Senqiao Yang
PhD. student in CUHK

Dream is possible!