whitehatStoic
whitehatStoic
Values-based model vs. large language models
0:00
-25:34

Values-based model vs. large language models

There is something in this blogpost that may solve the alignment problem
whitehatStoic
whitehatStoic
Exploring evolutionary psychology and archetypes, and leveraging gathered insights to create a safety-centric reinforcement learning (RL) method for LLMs