We predict approaches like these are definitely promising due to the fact language models by now master lots about human values throughout pretraining. Learning about human values is not contrary to learning about other subjects, and we should always be expecting larger models to have a far more accurate photograph of human values and to uncover th