Not sure if this is appropriate to ask here, but I recently found out about agents and Im wondering how they work. Based on a few videos I watched, it seems like gui based platform is a llm that you can ask to perform tasks. The other option being programming using a libary; not really sure how that works as I havent used it before. I also found out that theyre model with a markov decision process (which take the form (S, A, Pa, Ra), but the last two variables are unknown. The goal is finding the optimal policy (π). Are the action and states set predefined? How are the values of the last two variables calculated at each state? Not sure which flair would be most appropriate submitted by /u/BWJackal
Originally posted by u/BWJackal on r/ArtificialInteligence
