I am having some trouble getting accurate work done with Ai ( using open claw). No matter the model I try or the very explicit prompts, it seems that things that require repeated tasks or patience or double checking will rarely get done correctly. I have two examples from recent tasks: Academic powerpoint: - I had the ai come up with a spec and plan for a powerpoint and coordinate subagents for each task. It appropriately did a pubmed search, reviewed documents, came up with a summary and an outline. When it came time for it to download the pdfs and figures needed ( which requires manual download of up to 50 figures from different web pages) it kept stopping short of completing the task. And when it says this was complete, it was usually with unsaved figures and placeholders. I tried giving explicit prompts, changing to expensive models, asking google to perform through cli, asking it to double check etc but nothing fixed it so that it woild actually stick to the job until all the figures are obtained. Task 2: I asked it to run an analysis of my stock portfolio based on a pdf with all my transactions. Again it created the spec and plan. It seemed to track my transactions well, but the end result was always off. I tried everything from opus, gpt, sonnet, and gemini but the numbers remained inconsistent. I asked them to investigate and audit and they could not figure it out. I finally manually went back and discovered that they consistently assigned wrong values to some of the stocks when searching for the current price ( for example claude gave many of the stocks randomly a price of 25$- i am assuming after an NA when searching for the price online). Its frustrating because I asked it multiple times to make sure that all of the stocks analyzed have a correctly updated market price but clearly it just skipped so many of them. Its really infuriating because rhe rest of the stuff it did was amazing. The analysis was good beyond the input values, but somehow messing up this input with laziness ( consistently) meant that the task could never be done. submitted by /u/oikk01
Originally posted by u/oikk01 on r/ArtificialInteligence
