2024-01 Self-Rewarding Language Models