Definition
Online Search
Online search involves an agent interleaving computation (planning) and action execution. It decides on the next action, performs it, observes the outcome, and then plans the subsequent action. This method is employed when the environment is unknown, dynamic, or when the agent lacks complete information, forcing it to explore and make decisions incrementally (“search as you go”).
Online vs. Offline
Online is different from offline search in the following aspects:
- can’t “jump” in the search space to states
- follow “real” actions
- must go from state to “physical” successor state ⇒ local expansion
Example
Puzzle
Ingredients:
- : set of actions doable in state
- : cost of taking action in with result
- : goal test
- (optional) : heuristic
whether may be unknown.
Problems:
- Goal achievement: reach some goal from initial state (e.g., escape from a maze)
- Explore environment: learn , state space (e.g., map building)
- Model building: learn
Dead end states (no goal reachable) are unavoidable in all state space (which action too choose in ). Consider safe explorability: some goal is reachable from any state, e.g. maze, 8-puzzle, vacuum world.
Performance:
- Competitive ratio: cost of path travelled / optimal path cost
- Characteristics: size of the state space (not shallowest goal)