PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
Large Language Models act as an all-inclusive ground for engagement. However, the output of these models is not always aligned with the correct human preferences or values. This can produce a large range of inappropriate and offensive content. In human culture/society, proverbs act as a way to transmit and shape human values and behavior. In this research, we experiment to determine whether or not proverbs can play the same role for LLMs in transmitting and aligning to human values. Additionally, we research to determine how we can prompt LLMs to generate their own set of rules for categories of human values and use that as a means to align future-generated content to said values.
Conference Presentation
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Cesa Salaam andDanda B. Rawat
"When the occasion comes, the proverb comes: value alignment via proverbial bank and synthetic rules", Proc. SPIE 13051, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications VI, 130510O (7 June 2024); https://doi.org/10.1117/12.3014236
ACCESS THE FULL ARTICLE
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
The alert did not successfully save. Please try again later.
Cesa Salaam, Danda B. Rawat, "When the occasion comes, the proverb comes: value alignment via proverbial bank and synthetic rules," Proc. SPIE 13051, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications VI, 130510O (7 June 2024); https://doi.org/10.1117/12.3014236