![]() However, in a sequential environment, an agent requires memory of past actions to determine the next best actions and current actions affect future decisions e.g chessīut I think the sliding puzzle can be considered a sequential environment because the agent must make a series of actions (moving tiles) in a specific order to reach the goal state (the solved puzzle). One image has nothing to do with the next. ![]() In other words, current actions/decisions have no effect on future decisions.įor example, an agent that looks at radiology images to determine if there is a sickness is an example of an episodic environment. The actions in the next episode don't depend on the actions in past episodes. Why is the sliding puzzle problem episodic and not sequential?įrom what I understand, an environment is episodic if each episode is independent and doesn't affect past or future episodes.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |