The more sophisticated AI models get, the more likely they are to lie
Human feedback to AIs makes them favor providing an answer, even a wrong one, while making the answer more convincing.
Read more here: External Link
Human feedback to AIs makes them favor providing an answer, even a wrong one, while making the answer more convincing.
Read more here: External Link