AI post-training: Finetuning using PEFT and DPO on Cloudera AMP
Post-training is rapidly becoming a critical phase of enterprise AI development. To get reliable output from an AI model, organizations must align its terminology (e.g., abbreviation) to fit their specific use cases. But getting started shouldn't require heavy computing resources—you can quickly train an open-source model right on your local device. In this tutorial, we sit down with the ASAP_DPO_Finetuning Cloudera AMP to demonstrate exactly how to align a language model to specific industry standards—in this case, Oil & Gas abbreviations.