Presentation + Paper
7 June 2024 When the occasion comes, the proverb comes: value alignment via proverbial bank and synthetic rules
Cesa Salaam, Danda B. Rawat
Author Affiliations +
Abstract
Large Language Models act as an all-inclusive ground for engagement. However, the output of these models is not always aligned with the correct human preferences or values. This can produce a large range of inappropriate and offensive content. In human culture/society, proverbs act as a way to transmit and shape human values and behavior. In this research, we experiment to determine whether or not proverbs can play the same role for LLMs in transmitting and aligning to human values. Additionally, we research to determine how we can prompt LLMs to generate their own set of rules for categories of human values and use that as a means to align future-generated content to said values.
Conference Presentation
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Cesa Salaam and Danda B. Rawat "When the occasion comes, the proverb comes: value alignment via proverbial bank and synthetic rules", Proc. SPIE 13051, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications VI, 130510O (7 June 2024); https://doi.org/10.1117/12.3014236
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Artificial intelligence

Machine learning

Back to Top