LLMs are still making sh*t up.
— Yann LeCun (@ylecun) January 22, 2023
That's fine if you use them as writing assistants.
Not good as question answerers, search engines, etc.
RLHF merely mitigates the most frequent mistakes without actually fixing the problem. https://t.co/XnDxF8Q9Zr