Doctor of Engineering
With Certificate of Graduation for Doctorate Study
Gender:Female
E-Mail:3f576bce2f9b26ea560371e96fc27ce0ca4797c1cd74fe3d9d9fa48ab04aed191dcc2bdecaf1a8cde9fa1a59caa7ef194a303b906cd21df70b7d839ff1c9640a5f3dbd0e6761f2b020bdd6c46d471e80a9f2073860ca00fad2f4937d00e6d6fb24b7dee4e4dc9438e6bd1b669d93f82c4b6d9858020b214a72c83429440af8df
Affiliation of Author(s):[1] Shanghai Academy of AI for Science [2]Ningxia University [3]Imperial College London ...
Journal:Biomedical Optics Express
Abstract:In our effort to more reliably leverage large language models for designing effective photoresponsive molecular materials for drug delivery, we developed the Explanation CPO (XCPO), an end-to-end reinforcement learning fine-tuning workflow. The maturity of RLHF techniques has greatly stimulated the flourishing application of general-purpose LLMs across various scientific domains. Compared with representative RLHF approaches like CPO, our XCPO emphasizes the potential and effectiveness of a fully AI-driven, end-to-end fine-tuning process. In this workflow, the preference dataset for RLFT ...
Volume:16
Issue:1
Page Number:2232–2242
ISSN No.:2156-7085
Translation or Not:no
Address: Shahe Campus:No.4, Section 2, North Jianshe Road, 610054 | Qingshuihe Campus:No.2006, Xiyuan Ave, West Hi-Tech Zone, 611731 | Chengdu, Sichuan, P.R.China © 2010 University of Electronic Science and Technology of China. All Rights Reserved
Click: | The Last Update Time:.. | University of Electronic Science and Technology of China