Original Reddit post

This is one of those simple tasks that you would expect them to be able to do given all the other more complicated stuff they are capable off, yet they fail miserably at it. I tried asking which items are on both list A and B and it tells me almost all are, when in fact none are. Can someone explain why they get so mixed up on these tasks? And is any LLM good at it? submitted by /u/themainheadcase

Originally posted by u/themainheadcase on r/ArtificialInteligence