Return to Article Details
LARGE LANGUAGE MODEL ALIGNMENT AND SAFETY: A REINFORCEMENT LEARNING FROM HUMAN FEEDBACK FRAMEWORK FOR REDUCING HALLUCINATION, BIAS, AND HARMFUL OUTPUT IN DOMAIN-SPECIFIC LLMS
Download
Download PDF