Hey, I need recommendations on what api’s to use for accurately and consistently identifying what the gym equipment is in a user’s photo. I have tried gpt 5 mini (I understand it’s not a vision first model) but the results are inconsistent and very frequently misclassifies what the gym equipment is in the photo. I know there are options out there like gemini vision, but am worried if it will still misclassify what’s in the uploaded photo based on identified objects that have close matches or evident in different gym equipments. I came across perplexity sonar, and thought this would be a good approach because it leverages web searches. Wouldn’t it be more consistent in correctly identifying different angles of gym equipments in a user’s photo since it matches request against web searches? What do you think? What do you recommend? Thanks in advance submitted by /u/Lonely-Poet6867
Originally posted by u/Lonely-Poet6867 on r/ArtificialInteligence
