AI Mastery

Your go-to source for complete AI tutorials, notes, and free PDF downloads

Free Reading Alert! All my books are FREE on Kindle Unlimited or eBooks just ₹145!

Check now: https://www.amazon.in/stores/Anshuman-Mishra/author/B0DQVNPL7P

Start reading! 🚀

PREVIOUS PAGE INDEX PAGE NEXT PAGE

Search & Planning in AI: Heuristic Algorithms aur Problem-Solving Techniques

A Comprehensive Study Tutorial for Students, Researchers, and Professionals

N.B.- All my books are exclusively available on Amazon. The free notes/materials on globalcodemaster.com do NOT match even 1% with any of my PUBLISHED BOoks. Similar topics ≠ same content. Books have full details, exercises, chapters & structure — website notes do not. No book content is shared here. We fully comply with Amazon policies.

TABLE OF CONTENT

Chapter 1: Introduction to Problem Solving & Search in AI 1.1 What is Problem Solving in Artificial Intelligence? 1.2 Agents, Environments & Problem Types (Observable, Deterministic, Episodic, Static, Discrete, Known) 1.3 Problem Formulation: States, Actions, Goals, Path Cost, Solution Quality 1.4 Toy Examples & Canonical Problems 1.4.1 8-Puzzle / 15-Puzzle 1.4.2 Traveling Salesman Problem (TSP) 1.4.3 Route Finding / Navigation 1.4.4 Vacuum World, Blocks World, Missionaries & Cannibals 1.5 Measuring Performance: Completeness, Optimality, Time & Space Complexity 1.6 Search vs Planning: Key Differences & Overlaps

Chapter 2: Uninformed (Blind) Search Strategies 2.1 Tree Search vs Graph Search (Avoiding Cycles) 2.2 Breadth-First Search (BFS) – Properties & Implementation 2.3 Depth-First Search (DFS) – Properties, Stack Overflow Risk 2.4 Depth-Limited Search & Iterative Deepening Search (IDS/DFS) 2.5 Uniform-Cost Search (UCS / Dijkstra’s in Search Context) 2.6 Bidirectional Search 2.7 Comparison Table: Completeness, Optimality, Time/Space Complexity

Chapter 3: Informed (Heuristic) Search – Core Concepts 3.1 What is a Heuristic? Admissible, Consistent (Monotonic), Dominance 3.2 Heuristic Functions: Design Principles & Examples 3.2.1 Manhattan Distance, Euclidean Distance, Misplaced Tiles 3.2.2 Pattern Databases, Disjoint Pattern Databases 3.2.3 Domain-Independent Heuristics (Delete Relaxation, h⁺, h_max, h_add) 3.3 Greedy Best-First Search (Pure Heuristic Guidance) 3.4 A* Search – Optimal & Complete with Admissible Heuristics 3.5 Properties of A*: Optimality Proof, Consistency vs Admissibility 3.6 Weighted A* & Anytime Variants 3.7 IDA* (Iterative Deepening A*) – Memory-Efficient Optimal Search 3.8 Real-Time Variants: LRTA*, RTA*, ARA*

Chapter 4: Local & Metaheuristic Search Techniques 4.1 Hill Climbing & Variants (Simple, Stochastic, First-Choice, Random-Restart) 4.2 Simulated Annealing – Cooling Schedule & Metropolis Criterion 4.3 Genetic Algorithms & Evolutionary Search (Selection, Crossover, Mutation) 4.4 Tabu Search & Reactive Tabu 4.5 Beam Search & Variants (Stochastic Beam, Beam Stack) 4.6 Local Beam Search & Parallel Variants 4.7 When to Use Local Search: Large/Continuous Spaces, Approximation

Chapter 5: Adversarial Search & Game Playing 5.1 Minimax Algorithm – Perfect Information, Zero-Sum Games 5.2 Alpha-Beta Pruning – Optimizations & Effectiveness 5.3 Iterative Deepening, Move Ordering, Transposition Tables 5.4 Heuristic Evaluation Functions in Games (Static Board Evaluation) 5.5 Expectiminimax – Games with Chance 5.6 Monte Carlo Tree Search (MCTS) – UCT Formula 5.7 Applications: Chess, Go (AlphaGo Style), Real-Time Strategy Games

Chapter 6: Classical Planning – Foundations 6.1 Planning vs Search: Representation Power 6.2 STRIPS Language: Actions, Preconditions, Add/Delete Lists 6.3 PDDL (Planning Domain Definition Language) – Versions 1.2 to 3.x 6.4 Planning as State-Space Search 6.5 Planning as Plan-Space Search (Partial-Order Planning – POP) 6.6 Causal Links, Threats & Promotion/Demotion 6.7 Planning Graph – Mutexes, Level-Off, Graphplan Algorithm

Chapter 7: Heuristic Search in Planning 7.1 Planning as Heuristic Search Paradigm (HSP, FF) 7.2 Delete Relaxation Heuristics (h⁺, h_add, h_max, h_ff) 7.3 Pattern Databases in Planning 7.4 Landmark-Based Heuristics 7.5 Fast-Forward (FF) Heuristic & Enforced Hill-Climbing 7.6 LAMA Planner Family & Anytime Behavior 7.7 Modern Heuristic Planners: K* / Anytime A* in Planning

Chapter 8: Advanced & Hierarchical Planning Techniques 8.1 Hierarchical Task Network (HTN) Planning 8.2 SHOP & SHOP2 – Domain Knowledge Encoding 8.3 Angelic & Over-Approximation Planning 8.4 Temporal & Numeric Planning (PDDL 2.1+) 8.5 Probabilistic Planning (PPDDL, IPPC Benchmarks) 8.6 Continual / Online Planning & Replanning 8.7 Multi-Agent Planning & Coordination

Chapter 9: Constraint Satisfaction Problems (CSP) & Search 9.1 CSP Formulation: Variables, Domains, Constraints 9.2 Backtracking Search & Constraint Propagation 9.3 Arc Consistency (AC-3), Forward Checking, Maintaining Arc Consistency (MAC) 9.4 Variable & Value Ordering Heuristics (MRV, LCV, DEG) 9.5 CSP vs Classical Planning – Connections

Chapter 10: Evaluation, Benchmarks & Practical Considerations 10.1 Standard Benchmarks (IPC – International Planning Competition, Gridworld, Sokoban, Logistics) 10.2 Performance Metrics: Makespan, Plan Length, CPU Time, Memory 10.3 Optimality vs Speed Trade-offs 10.4 Handling Uncertainty, Non-Determinism & Partial Observability (Brief Intro to POMDPs) 10.5 Implementation Tips: Python (simpleai, unified-planning), C++ (Fast Downward, OPENCog)

Chapter 11: Applications & Case Studies 11.1 Robotics: Motion Planning, Task & Motion Planning (TAMP) 11.2 Autonomous Vehicles & Drones: Path Planning 11.3 Games & Virtual Agents: NPC Behavior, Strategy Games 11.4 Logistics & Scheduling: Job-Shop, Vehicle Routing 11.5 Natural Language Generation & Dialogue Planning 11.6 End-to-End Project Examples (8-Puzzle Solver, Blocks World Planner, Simple Game AI)

Chapter 12: Current Trends & Open Research Directions (as of 2025–2026) 12.1 Neural Heuristics & Learning to Search 12.2 Large Language Models as Planners (ReAct, Plan-and-Solve Prompting) 12.3 Neuro-Symbolic Planning 12.4 Scalable Planning in Real-World Domains (Energy, Healthcare, Climate) 12.5 Integration with Reinforcement Learning (Search + RL Hybrids)

Chapter 1: Introduction to Problem Solving & Search in Artificial Intelligence

Problem solving is one of the central goals of Artificial Intelligence (AI). Many AI systems are designed to analyze a situation, explore possible solutions, and select the most effective action to achieve a specific goal. This process often involves searching through possible states of a problem to find an optimal or satisfactory solution.

Problem-solving techniques are widely used in areas such as robotics, navigation systems, game playing, automated reasoning, and decision support systems. Understanding how AI represents and solves problems forms the foundation for more advanced topics such as planning, machine learning, and reinforcement learning.

1.1 What is Problem Solving in Artificial Intelligence?

Problem solving in Artificial Intelligence refers to the process by which an intelligent system identifies a sequence of actions that leads from an initial state to a desired goal state.

An AI system typically explores possible states of the environment and applies search strategies to determine the best solution.

Example:

Consider a navigation system such as a GPS application.

Initial State: Current location
Goal State: Destination location
Actions: Move through roads or intersections

The AI system must determine the optimal route that minimizes travel time or distance.

Problem solving in AI generally involves:

• Representing the problem formally
• Exploring possible states
• Evaluating candidate solutions
• Selecting the best action sequence

1.2 Agents, Environments & Problem Types

An AI system interacts with its surroundings through an agent operating within an environment.

Intelligent Agent

An intelligent agent is an entity that perceives its environment through sensors and acts upon it through actuators.

Example:

Self-driving car:

Sensors → Cameras, LiDAR
Actuators → Steering, brakes, accelerator

The goal of the agent is to maximize performance based on a defined objective.

Environment Characteristics

AI environments can be classified according to several properties.

Observable vs Partially Observable

Fully observable environments allow the agent to perceive the entire state.

Example:

Chess board.

Partially observable environments provide limited information.

Example:

Poker game.

Deterministic vs Stochastic

In deterministic environments, the next state is fully determined by the current state and action.

Example:

Chess moves.

In stochastic environments, actions may lead to uncertain outcomes.

Example:

Robot movement with possible slippage.

Episodic vs Sequential

Episodic environments treat each decision independently.

Example:

Spam email classification.

Sequential environments involve decisions that affect future states.

Example:

Chess game.

Static vs Dynamic

Static environments remain unchanged while the agent decides.

Example:

Crossword puzzle.

Dynamic environments change during decision making.

Example:

Autonomous driving.

Discrete vs Continuous

Discrete environments have a finite number of states.

Example:

Board games.

Continuous environments involve infinitely many states.

Example:

Robot arm movement.

Known vs Unknown

In known environments, the agent understands the rules and effects of actions.

In unknown environments, the agent must learn the environment through experience.

1.3 Problem Formulation: States, Actions, Goals, Path Cost, Solution Quality

To solve a problem effectively, an AI system must first formulate the problem clearly.

States

A state describes the configuration of the system at a specific moment.

Example:

In a puzzle game, the arrangement of tiles represents the current state.

Actions

Actions define the possible operations that move the system from one state to another.

Example:

In an 8-puzzle, actions include moving tiles up, down, left, or right.

Goal State

The goal state represents the desired final configuration.

Example:

Correct arrangement of puzzle tiles.

Path Cost

Path cost measures the total cost of reaching a state.

Examples include:

• Distance traveled
• Time taken
• Energy consumption

Solution Quality

The quality of a solution depends on how well it satisfies the problem constraints.

Criteria may include:

• Minimum cost
• Minimum time
• Maximum efficiency

1.4 Toy Examples & Canonical Problems

AI research often uses simplified problems to demonstrate search algorithms and reasoning techniques.

These examples help researchers analyze algorithm behavior before applying them to real-world problems.

1.4.1 8-Puzzle / 15-Puzzle

The 8-puzzle consists of a 3×3 grid with numbered tiles and one empty space.

Goal:

Arrange tiles in correct order by sliding them into the empty position.

Example goal state:

1 2 3
4 5 6
7 8 □

The challenge is to determine the sequence of moves required to reach the goal.

The 15-puzzle is a larger version with a 4×4 grid.

1.4.2 Traveling Salesman Problem (TSP)

The Traveling Salesman Problem involves finding the shortest possible route that visits a set of cities exactly once and returns to the starting city.

Example:

Cities: A, B, C, D

Possible route:

A → B → C → D → A

Objective:

Minimize total travel distance.

TSP is widely used in logistics, delivery optimization, and transportation planning.

1.4.3 Route Finding / Navigation

Route finding is a classic AI search problem.

Example:

Finding the shortest path between cities on a map.

Applications include:

• GPS navigation systems
• Internet routing
• Robot path planning

Search algorithms such as A* and Dijkstra's algorithm are commonly used.

1.4.4 Vacuum World, Blocks World, Missionaries & Cannibals

These classic AI problems are widely used in academic research.

Vacuum World

A simple environment where an agent cleans dirty rooms.

Goal: Clean all rooms using minimal actions.

Blocks World

Blocks are stacked on a table, and the agent must rearrange them to achieve a desired configuration.

Missionaries and Cannibals Problem

Three missionaries and three cannibals must cross a river using a boat.

Constraint:

Cannibals cannot outnumber missionaries on either side.

This problem demonstrates constraint satisfaction and search strategies.

1.5 Measuring Performance: Completeness, Optimality, Time & Space Complexity

Search algorithms are evaluated based on several performance criteria.

Completeness

Completeness indicates whether the algorithm guarantees finding a solution if one exists.

Example:

Breadth-first search is complete in finite state spaces.

Optimality

Optimality refers to whether the algorithm finds the best possible solution.

Example:

Uniform-cost search guarantees optimal solutions when costs are non-negative.

Time Complexity

Time complexity measures the number of nodes expanded during search.

High time complexity can make algorithms impractical for large problems.

Space Complexity

Space complexity measures the memory required by the algorithm.

Some algorithms require large memory to store explored states.

Example:

Breadth-first search may require storing many nodes in memory.

1.6 Search vs Planning: Key Differences & Overlaps

Search and planning are closely related concepts in artificial intelligence.

Search

Search focuses on exploring possible states to find a path from the initial state to the goal state.

Example:

Finding shortest path in a map.

Search algorithms include:

• Breadth-First Search
• Depth-First Search
• A* Search

Planning

Planning focuses on determining sequences of actions that achieve complex goals.

Example:

Robot assembling a product in a factory.

Planning systems often use:

• Logical reasoning
• Action preconditions
• Goal decomposition

Differences

Search generally works with explicit state spaces, while planning often uses symbolic representations and logical reasoning.

However, many planning systems rely on search techniques to explore possible action sequences.

Conclusion

Problem solving and search are fundamental concepts in artificial intelligence. By representing problems as states, actions, and goals, AI systems can explore possible solutions and select optimal strategies.

Classic problems such as the 8-puzzle, Traveling Salesman Problem, and route finding provide valuable insights into search algorithm design. Performance metrics such as completeness, optimality, and computational complexity help evaluate the effectiveness of these algorithms.

Understanding these concepts lays the groundwork for more advanced AI topics including heuristic search, planning systems, constraint satisfaction, and intelligent decision-making algorithms.

Chapter 2: Uninformed (Blind) Search Strategies

Uninformed search strategies, also called blind search algorithms, explore a problem’s state space without using any additional domain knowledge. These algorithms only use the information available in the problem definition such as:

• Initial state
• Possible actions
• Transition model
• Goal test

Unlike heuristic search algorithms, uninformed search does not estimate how close a state is to the goal. Instead, it systematically explores the state space until it finds a solution.

These strategies are widely used as foundational methods in Artificial Intelligence and form the basis for more advanced search techniques.

2.1 Tree Search vs Graph Search (Avoiding Cycles)

Search algorithms can represent the search space in two primary ways: tree search and graph search.

Tree Search

In tree search, the algorithm explores states as nodes of a tree structure.

Each node represents a state, and edges represent actions.

Example:

Initial State → Successor States → Further Successors

However, tree search has a major limitation:

The same state may be visited multiple times, leading to redundant computations.

Example:

In route finding, a city may be reached through different paths, causing repeated exploration.

Graph Search

Graph search improves efficiency by maintaining a visited set (also called an explored set).

Steps:

Expand a node
Add it to the explored set
Avoid revisiting already explored states

Advantages:

• Prevents infinite loops
• Reduces redundant computations
• Improves search efficiency

Graph search is commonly used in practical AI systems.

2.2 Breadth-First Search (BFS) – Properties & Implementation

Breadth-First Search explores the search tree level by level, expanding all nodes at the current depth before moving to the next level.

BFS uses a queue (FIFO – First In First Out) data structure.

BFS Algorithm Steps

Start with the initial node
Add the node to a queue
Remove the front node from the queue
Check if it is the goal
If not, expand its children and add them to the queue
Repeat until goal is found

Example

Consider a simple tree:

A
/ \
B C
/ \ / \
D E F G

BFS traversal order:

A → B → C → D → E → F → G

BFS Properties

Completeness: Yes (if branching factor is finite)

Optimality: Yes when all step costs are equal

Time Complexity:
O(bᵈ)

Space Complexity:
O(bᵈ)

Where:

b = branching factor
d = depth of the solution

BFS Implementation (Python)

from collections import deque

def bfs(graph, start, goal):
queue = deque([start])
visited = set()

while queue:
node = queue.popleft()

if node == goal:
return True

if node not in visited:
visited.add(node)
queue.extend(graph[node])

return False

2.3 Depth-First Search (DFS) – Properties, Stack Overflow Risk

Depth-First Search explores the deepest nodes first before exploring siblings.

DFS uses a stack (LIFO – Last In First Out).

DFS Algorithm Steps

Start with the initial node
Push node onto stack
Pop node from stack
Check if it is the goal
Expand children and push them onto stack
Repeat until solution is found

Example Traversal

Using the same tree:

DFS order:

A → B → D → E → C → F → G

DFS Properties

Completeness: No (may get stuck in infinite paths)

Optimality: No

Time Complexity:
O(bᵐ)

Space Complexity:
O(bm)

Where:

m = maximum depth of search tree

Stack Overflow Risk

DFS may explore very deep branches before finding the goal.

If the tree depth is extremely large, recursion-based implementations may cause stack overflow errors.

2.4 Depth-Limited Search & Iterative Deepening Search (IDS)

Depth-Limited Search improves DFS by imposing a maximum depth limit.

Depth-Limited Search

The search stops when the depth limit is reached.

Example:

Limit = 3

Nodes deeper than level 3 will not be explored.

Advantages:

• Prevents infinite paths
• Controls search depth

However, if the solution lies deeper than the limit, the algorithm fails.

Iterative Deepening Search (IDS)

Iterative Deepening combines the advantages of BFS and DFS.

It repeatedly performs depth-limited search with increasing limits.

Search sequence:

Depth limit = 0
Depth limit = 1
Depth limit = 2
Depth limit = 3

Example:

Iteration 1: A
Iteration 2: A → B → C
Iteration 3: A → B → D → E → C → F → G

IDS Properties

Completeness: Yes

Optimality: Yes (for equal step costs)

Time Complexity:
O(bᵈ)

Space Complexity:
O(bd)

IDS is widely used in large search spaces.

2.5 Uniform-Cost Search (UCS / Dijkstra’s in Search Context)

Uniform-Cost Search expands the node with the lowest path cost.

It is equivalent to Dijkstra’s shortest path algorithm.

Instead of exploring nodes level by level, UCS always chooses the node with the minimum cumulative cost.

UCS Evaluation Function

f(n) = g(n)

Where:

g(n) = cost of path from start to node n

Example

Suppose paths have costs:

A → B = 2
A → C = 5
B → D = 4
C → D = 1

UCS chooses nodes with lowest accumulated cost first.

UCS Properties

Completeness: Yes

Optimality: Yes (when costs are non-negative)

Time Complexity:
O(b^(1 + ⌊C*/ε⌋))

Space Complexity: Same as time complexity

Where:

C* = optimal solution cost
ε = minimum step cost

2.6 Bidirectional Search

Bidirectional search runs two simultaneous searches:

• Forward search from initial state
• Backward search from goal state

The search stops when the two frontiers meet.

Example

Initial → A → B → C → Goal

Forward search explores:

Initial → A → B

Backward search explores:

Goal → C → B

The meeting point is node B.

Advantages

• Significantly reduces search depth
• Faster than single-direction search

Time Complexity:

O(b^(d/2))

This is much smaller than BFS complexity O(bᵈ).

Limitations

• Requires ability to generate predecessor states
• Difficult for problems with unknown goal states

2.7 Comparison Table: Completeness, Optimality, Time/Space Complexity

AlgorithmCompletenessOptimalityTime ComplexitySpace ComplexityBFSYesYes (equal costs)O(bᵈ)O(bᵈ)DFSNoNoO(bᵐ)O(bm)Depth-LimitedNoNoO(bˡ)O(bl)Iterative DeepeningYesYesO(bᵈ)O(bd)Uniform-Cost SearchYesYesExponentialExponentialBidirectionalYesYesO(b^(d/2))O(b^(d/2))

Where:

b = branching factor
d = depth of solution
m = maximum depth
l = depth limit

Conclusion

Uninformed search strategies provide fundamental techniques for exploring problem spaces when no additional information about the goal is available. Algorithms such as Breadth-First Search, Depth-First Search, Uniform-Cost Search, and Iterative Deepening Search offer different trade-offs between time efficiency, memory usage, and solution optimality.

Understanding these algorithms is essential for designing intelligent systems capable of solving complex problems. These blind search strategies also serve as the foundation for more advanced methods such as heuristic search, A* algorithms, and planning systems used in modern artificial intelligence applications.

Chapter 3: Informed (Heuristic) Search – Core Concepts

Informed search algorithms improve upon uninformed search by using additional knowledge about the problem domain. This knowledge is typically encoded in the form of a heuristic function, which estimates how close a given state is to the goal state.

Heuristic search strategies are significantly more efficient because they guide the search toward promising regions of the state space rather than exploring blindly. These methods are widely used in applications such as route planning, game playing, robotics navigation, and automated planning systems.

3.1 What is a Heuristic? Admissible, Consistent (Monotonic), Dominance

A heuristic is a function that estimates the cost from a given state to the goal state.

This function helps the algorithm decide which nodes should be explored first.

A heuristic function is typically denoted as:

h(n)

where:

n = current node
h(n) = estimated cost from node n to the goal

Admissible Heuristic

A heuristic is admissible if it never overestimates the true cost of reaching the goal.

Mathematically:

h(n) \leq h^*(n)

Where:

h*(n) = true optimal cost from n to the goal.

Admissible heuristics guarantee that algorithms like A* find optimal solutions.

Consistent (Monotonic) Heuristic

A heuristic is consistent if it satisfies the triangle inequality.

h(n) \leq c(n,n') + h(n')

Where:

c(n,n') = cost from node n to successor n'

Consistency ensures that the estimated cost always decreases along optimal paths.

If a heuristic is consistent, it is automatically admissible.

Dominance

Given two admissible heuristics h₁ and h₂:

h₁(n) ≥ h₂(n)

for all nodes n, then h₁ dominates h₂.

A dominating heuristic generally leads to faster search performance because it provides better estimates.

3.2 Heuristic Functions: Design Principles & Examples

Designing effective heuristics is crucial for improving search efficiency.

Good heuristic functions should:

• Be easy to compute
• Closely approximate the true cost
• Never overestimate the goal distance (for admissibility)

3.2.1 Manhattan Distance, Euclidean Distance, Misplaced Tiles

These are commonly used heuristics in search problems.

Manhattan Distance

Used in grid-based navigation and puzzles.

It measures the total horizontal and vertical distance between two positions.

h(n) = |x_1 - x_2| + |y_1 - y_2|

Example:

Moving from (2,3) to (5,6)

Manhattan distance = 3 + 3 = 6

Euclidean Distance

Used when movement is allowed in any direction.

h(n) = \sqrt{(x_1-x_2)^2 + (y_1-y_2)^2}

Example:

Distance between two geographic locations.

Misplaced Tiles

Used in puzzle problems such as the 8-puzzle.

The heuristic counts the number of tiles that are not in their correct positions.

Example:

If 3 tiles are misplaced:

h(n) = 3

3.2.2 Pattern Databases & Disjoint Pattern Databases

Pattern databases store precomputed optimal costs for subsets of a problem.

Example:

In the 15-puzzle problem, a pattern database may store the optimal number of moves required for certain tile configurations.

Advantages:

• Provides highly accurate heuristic estimates
• Improves search efficiency significantly

Disjoint Pattern Databases

These divide the problem into independent subsets.

Example:

One database for tiles {1,2,3,4}
Another for tiles {5,6,7,8}

The heuristic value is the sum of the independent pattern costs.

This approach yields very strong heuristics.

3.2.3 Domain-Independent Heuristics

Domain-independent heuristics are used in automated planning systems.

Delete Relaxation

Delete effects of actions are ignored.

Example:

If an action removes a condition, the relaxed problem ignores this removal.

h⁺ heuristic

Represents the optimal solution cost in the relaxed planning problem.

h_max heuristic

Estimates cost as the maximum cost among subgoals.

h_add heuristic

Estimates cost as the sum of individual goal costs.

These heuristics are commonly used in automated planning systems like Fast Downward.

3.3 Greedy Best-First Search (Pure Heuristic Guidance)

Greedy Best-First Search selects the node with the lowest heuristic value.

Evaluation function:

f(n) = h(n)

This means the algorithm always chooses the node that appears closest to the goal.

Advantages

• Fast search in many cases
• Focuses directly on promising nodes

Disadvantages

• Not guaranteed to find optimal solutions
• May get stuck in local minima

Example:

A navigation system always choosing roads that appear closest to the destination.

3.4 A* Search – Optimal & Complete with Admissible Heuristics

A* is one of the most important heuristic search algorithms.

It combines:

• Path cost from start node
• Heuristic estimate to goal

Evaluation function:

f(n) = g(n) + h(n)

Where:

g(n) = cost from start to node n
h(n) = estimated cost from n to goal

The algorithm expands the node with the lowest f(n) value.

Advantages of A*

• Complete
• Optimal with admissible heuristic
• Efficient compared to blind search

Applications include:

• GPS navigation systems
• Robot path planning
• Game AI

3.5 Properties of A*: Optimality Proof, Consistency vs Admissibility

A* is optimal when the heuristic is admissible.

The algorithm guarantees that the first solution found is the lowest-cost solution.

Consistency vs Admissibility

Admissible heuristic:

Never overestimates true cost.

Consistent heuristic:

Satisfies triangle inequality.

Consistency ensures that f-values never decrease along a path, simplifying implementation.

3.6 Weighted A* & Anytime Variants

Weighted A* modifies the evaluation function to speed up search.

f(n) = g(n) + w \cdot h(n)

Where:

w > 1

Advantages:

• Faster search
• Finds near-optimal solutions quickly

Disadvantages:

• Optimality may be lost.

Anytime Algorithms

Anytime algorithms quickly produce a solution and then improve it over time.

Example:

Anytime A*

• Finds initial solution quickly
• Refines it gradually

3.7 IDA* (Iterative Deepening A*) – Memory-Efficient Optimal Search

IDA* combines A* with iterative deepening search.

Instead of storing all nodes in memory, IDA* performs depth-first searches with increasing cost limits.

Search limit is based on:

f(n)

Advantages:

• Much lower memory usage
• Maintains optimality

IDA* is widely used in solving puzzles such as the 15-puzzle.

3.8 Real-Time Variants: LRTA, RTA, ARA*

Real-time heuristic search algorithms are designed for environments where agents must act quickly.

LRTA* (Learning Real-Time A*)

The agent updates heuristic values during exploration.

This allows learning while navigating the environment.

Example:

Robot exploring an unknown building.

RTA* (Real-Time A*)

RTA* limits search depth to ensure real-time responses.

The agent selects the best action based on limited lookahead.

ARA* (Anytime Repairing A*)

ARA* combines:

• Anytime search
• Weighted A*

It quickly finds a suboptimal solution and improves it progressively.

Applications include:

• Autonomous robotics
• Real-time navigation systems

Conclusion

Heuristic search algorithms dramatically improve problem-solving efficiency in artificial intelligence by guiding exploration toward promising regions of the state space. Concepts such as admissible and consistent heuristics ensure reliable performance and optimal solutions.

Algorithms such as Greedy Best-First Search, A*, IDA*, and weighted A* demonstrate different trade-offs between optimality, memory usage, and computational efficiency. Real-time variants like LRTA* and RTA* enable intelligent agents to operate effectively in dynamic environments.

These heuristic search techniques are fundamental components of modern AI systems used in navigation, robotics, automated planning, and intelligent decision-making applications.

Chapter 4: Local & Metaheuristic Search Techniques

Local search algorithms are designed to find good solutions in very large or continuous search spaces. Unlike systematic search algorithms (such as BFS or A*), local search algorithms focus on iteratively improving a single candidate solution rather than exploring the entire search tree.

These techniques are particularly useful when the search space is extremely large and exact solutions are computationally infeasible. Local and metaheuristic search methods are widely used in optimization problems such as scheduling, routing, resource allocation, and machine learning parameter tuning.

4.1 Hill Climbing & Variants (Simple, Stochastic, First-Choice, Random-Restart)

Hill climbing is one of the simplest local search algorithms. It starts with an initial solution and repeatedly moves to a neighboring solution that improves the evaluation function.

The idea is similar to climbing a hill, where the algorithm always moves in the direction of increasing value.

Basic Hill Climbing

Steps:

Start with an initial state
Evaluate neighboring states
Move to the neighbor with the highest value
Repeat until no better neighbor exists

Example:

Optimizing a mathematical function.

The algorithm moves toward higher function values until it reaches a peak.

Problems with Hill Climbing

Hill climbing may get stuck in:

Local Maximum

A solution better than neighbors but not the best overall.

Plateau

Flat region where many states have the same value.

Ridge

A narrow path where simple moves do not improve the solution.

Variants of Hill Climbing

Simple Hill Climbing

Examines neighbors sequentially and selects the first improvement.

Stochastic Hill Climbing

Randomly selects among improving neighbors.

This reduces the chance of getting stuck in local maxima.

First-Choice Hill Climbing

Generates random neighbors until it finds an improvement.

This approach is efficient when the number of neighbors is very large.

Random-Restart Hill Climbing

The algorithm runs hill climbing multiple times with different initial states.

The best result among the runs is selected.

This technique significantly improves the probability of finding the global optimum.

4.2 Simulated Annealing – Cooling Schedule & Metropolis Criterion

Simulated Annealing is inspired by the physical process of annealing in metallurgy, where materials are heated and slowly cooled to reduce structural defects.

The algorithm allows occasional downhill moves to escape local maxima.

Acceptance Probability

Simulated annealing accepts worse solutions based on the following probability.

P=e−ΔE/TP = e^{-\Delta E / T}P=e−ΔE/T

AAA

kkk

y=Ae−kt≈6e−0.6ty = A e^{-kt} \approx 6 e^{-0.6t}y=Ae−kt≈6e−0.6t

Where:

ΔE = change in evaluation function
T = temperature parameter

Higher temperatures allow more exploration.

Cooling Schedule

The temperature gradually decreases during the search.

Common cooling strategies:

• Linear cooling
• Exponential cooling
• Logarithmic cooling

As temperature approaches zero, the algorithm behaves like hill climbing.

Advantages

• Escapes local maxima
• Works well for complex optimization problems

Applications include:

• Traveling Salesman Problem
• Circuit design optimization
• Scheduling problems

4.3 Genetic Algorithms & Evolutionary Search

Genetic Algorithms (GAs) are inspired by the process of biological evolution.

Instead of working with a single solution, genetic algorithms maintain a population of candidate solutions.

Each solution is represented as a chromosome.

Main Components of Genetic Algorithms

Selection

Selects individuals based on fitness.

Common methods:

• Roulette wheel selection
• Tournament selection

Higher fitness individuals are more likely to reproduce.

Crossover

Combines two parent solutions to produce offspring.

Example:

Parent 1: 110010
Parent 2: 101111

Offspring: 110111

Mutation

Randomly modifies genes to introduce diversity.

Example:

110010 → 110110

Mutation prevents premature convergence.

Advantages

• Suitable for complex optimization problems
• Explores large search spaces effectively

Applications include:

• Machine learning parameter tuning
• Engineering design optimization
• Scheduling problems

4.4 Tabu Search & Reactive Tabu

Tabu Search is an advanced local search method that avoids revisiting recently explored solutions.

The algorithm maintains a tabu list, which stores recently visited states.

Moves that lead to these states are temporarily forbidden.

Tabu List

The tabu list prevents the algorithm from cycling back to previous solutions.

Example:

If state A was recently visited, the algorithm cannot return to A for a certain number of iterations.

Reactive Tabu Search

Reactive tabu search dynamically adjusts the length of the tabu list based on search behavior.

Advantages:

• Prevents cycling
• Encourages exploration of new regions

Applications include:

• Vehicle routing problems
• Job scheduling
• Network optimization

4.5 Beam Search & Variants (Stochastic Beam, Beam Stack)

Beam search is a heuristic search algorithm that explores only the best k nodes at each level.

The parameter k is called the beam width.

Example:

If beam width = 3, only the top three nodes are expanded at each step.

Advantages

• Reduces memory usage
• Faster than exhaustive search

Variants

Stochastic Beam Search

Selects nodes probabilistically rather than deterministically.

This maintains diversity among candidate solutions.

Beam Stack Search

Combines beam search with backtracking to recover from poor choices.

4.6 Local Beam Search & Parallel Variants

Local beam search maintains multiple states simultaneously and allows them to share information.

Steps:

Start with k random states
Generate successors of all states
Select the best k states among successors
Repeat until solution is found

Advantages

• Combines exploration and exploitation
• Often faster than independent hill climbing

Parallel Variants

Parallel implementations distribute search across multiple processors.

Benefits include:

• Faster computation
• Better exploration of search space

Parallel beam search is commonly used in natural language processing and AI planning systems.

4.7 When to Use Local Search

Local search algorithms are particularly useful when:

• The search space is extremely large
• Exact solutions are not required
• Approximate solutions are acceptable
• The problem is continuous or high-dimensional

Examples include:

• Traveling Salesman Problem
• Neural network weight optimization
• Scheduling and resource allocation

Local search methods often produce good solutions quickly, even if they do not guarantee optimality.

Conclusion

Local and metaheuristic search techniques provide powerful tools for solving complex optimization problems where traditional search methods are impractical. Algorithms such as hill climbing, simulated annealing, genetic algorithms, tabu search, and beam search offer different strategies for exploring large search spaces efficiently.

While these methods do not always guarantee optimal solutions, they are highly effective in finding near-optimal solutions within reasonable computational time. As a result, they are widely applied in fields such as operations research, machine learning, robotics, and engineering optimization

Chapter 5: Adversarial Search & Game Playing

Adversarial search is used in Artificial Intelligence for problems involving multiple competing agents, typically in games where one player’s gain is another player’s loss. These problems are often modeled as zero-sum games, where the total payoff for all players sums to zero.

In such environments, AI systems must consider not only their own moves but also the possible responses of opponents. Adversarial search algorithms evaluate game states and predict optimal strategies for both players.

Applications of adversarial search include:

• Chess and board games
• Video game AI
• Strategic planning systems
• Military simulations

5.1 Minimax Algorithm – Perfect Information, Zero-Sum Games

The Minimax algorithm is the fundamental method used for adversarial search in two-player games.

It assumes:

• Both players play optimally
• One player tries to maximize the score
• The other player tries to minimize the score

Game tree representation:

MAX player → tries to maximize score
MIN player → tries to minimize score

The algorithm explores all possible game states and selects the move that leads to the best worst-case outcome.

Minimax Value Function

V(s) = \begin{cases} \max_{a} V(Result(s,a)) & \text{if MAX turn} \ \min_{a} V(Result(s,a)) & \text{if MIN turn} \end{cases}

Where:

s = current state
a = possible action

Example

Consider a simplified game tree:

MAX
/ \
MIN MIN
/ \ / \
3 5 2 9

Evaluation:

MIN nodes choose minimum values.

Left branch → min(3,5) = 3
Right branch → min(2,9) = 2

MAX node chooses maximum:

max(3,2) = 3

Optimal move → left branch.

Limitations

Minimax suffers from exponential growth of game trees.

Time complexity:

O(bᵈ)

Where:

b = branching factor
d = search depth

5.2 Alpha-Beta Pruning – Optimizations & Effectiveness

Alpha-Beta pruning improves the efficiency of the minimax algorithm by eliminating branches that cannot affect the final decision.

Two parameters are maintained:

α (alpha): best value for MAX
β (beta): best value for MIN

Pruning Condition

If:

α ≥ β

then the remaining branches can be pruned.

Advantages

• Reduces number of nodes explored
• Produces same optimal result as minimax
• Allows deeper search

Best-case complexity:

O(b^(d/2))

This effectively doubles the search depth compared to plain minimax.

5.3 Iterative Deepening, Move Ordering, Transposition Tables

Game-playing AI often combines multiple techniques to improve performance.

Iterative Deepening

Instead of searching directly to a deep level, the algorithm performs repeated searches with increasing depth.

Example:

Depth 1 → Depth 2 → Depth 3 → Depth 4

Benefits:

• Provides anytime results
• Improves move ordering
• Works well with time constraints

Move Ordering

Alpha-beta pruning works best when good moves are evaluated first.

Techniques include:

• Killer heuristic
• History heuristic
• Prior evaluation knowledge

Good move ordering significantly increases pruning efficiency.

Transposition Tables

Many game positions can be reached through different move sequences.

Transposition tables store previously evaluated states to avoid redundant computations.

Example:

Hash table storing:

State → Evaluation value

This technique is widely used in modern chess engines.

5.4 Heuristic Evaluation Functions in Games

When the game tree is too large to search completely, the algorithm stops at a limited depth and evaluates positions using heuristic evaluation functions.

These functions estimate the desirability of a game state.

Example: Chess Evaluation

Typical evaluation features include:

• Material balance
• Piece mobility
• King safety
• Pawn structure
• Board control

Example evaluation formula:

Evaluation = Material + Position + Mobility

Higher scores favor the MAX player.

5.5 Expectiminimax – Games with Chance

Some games involve random events or probabilistic outcomes.

Examples include:

• Backgammon
• Dice games
• Card games

In these cases, the game tree contains chance nodes.

Expectiminimax extends minimax to handle probabilistic outcomes.

Expectiminimax Value

V(s) = \sum_{i} P(i) \cdot V(Result_i(s))

Where:

P(i) = probability of outcome i

The algorithm computes the expected value of outcomes.

5.6 Monte Carlo Tree Search (MCTS) – UCT Formula

Monte Carlo Tree Search is a modern game-playing algorithm that combines random simulation with tree search.

MCTS operates in four steps:

Selection
Expansion
Simulation
Backpropagation

UCT (Upper Confidence Bound for Trees)

Node selection is guided by the UCT formula.

UCT = \frac{w_i}{n_i} + c \sqrt{\frac{\ln N}{n_i}}

Where:

wᵢ = number of wins for node
nᵢ = visits of node
N = total simulations
c = exploration constant

The formula balances:

• Exploration (trying new moves)
• Exploitation (choosing best-known moves)

5.7 Applications: Chess, Go, and Real-Time Strategy Games

Adversarial search techniques power many modern AI game systems.

Chess Engines

Modern chess engines use:

• Alpha-beta pruning
• Transposition tables
• Heuristic evaluation functions

Examples include:

Stockfish and Deep Blue.

Go (AlphaGo Style)

The game of Go has an enormous search space.

AlphaGo combines:

• Deep neural networks
• Monte Carlo Tree Search
• Reinforcement learning

This system defeated world champion Go players.

Real-Time Strategy Games

Games like StarCraft require:

• Resource management
• Long-term planning
• Multi-agent coordination

AI systems combine:

• Reinforcement learning
• Monte Carlo search
• Planning algorithms

Conclusion

Adversarial search plays a crucial role in AI systems designed for competitive environments. Algorithms such as minimax and alpha-beta pruning provide systematic methods for evaluating game trees and selecting optimal strategies.

Advanced techniques such as iterative deepening, heuristic evaluation functions, and transposition tables significantly improve performance in complex games. Monte Carlo Tree Search represents a modern approach that enables AI systems to handle extremely large decision spaces.

These algorithms have enabled AI systems to achieve superhuman performance in games such as chess and Go, and they continue to influence modern AI research in fields such as strategic planning, robotics, and decision-making systems.

Chapter 6: Classical Planning – Foundations

Classical planning is a major area of Artificial Intelligence concerned with generating sequences of actions that achieve specific goals. While search algorithms explore state spaces to find solutions, planning focuses on representing actions, goals, and environmental changes in a structured manner.

Planning techniques are widely used in:

• Robotics
• Automated scheduling
• Logistics and transportation systems
• Game AI
• Autonomous agents

In classical planning, the environment is typically assumed to be fully observable, deterministic, static, and discrete.

6.1 Planning vs Search: Representation Power

Although planning and search are closely related, they differ in how problems are represented and solved.

Search algorithms operate on explicit state spaces, where each state represents a possible configuration of the system.

Example:

A route-finding problem where nodes represent cities.

Planning, however, focuses on actions and their effects.

Example:

Robot actions such as:

• Pick object
• Move object
• Place object

Planning systems reason about preconditions and effects of actions rather than simply exploring states.

Comparison

AspectSearchPlanningRepresentationStates and transitionsActions, preconditions, effectsReasoningPath findingAction sequencingComplexityExplicit search spaceSymbolic reasoning

Planning often incorporates search techniques internally.

6.2 STRIPS Language: Actions, Preconditions, Add/Delete Lists

STRIPS (Stanford Research Institute Problem Solver) is one of the earliest planning languages used to represent planning problems.

A STRIPS action is defined by three components:

• Preconditions
• Add list
• Delete list

Action Representation

Action: Move(A,B)

Preconditions:

Robot at location A
Path exists from A to B

Add list:

Robot at location B

Delete list:

Robot at location A

When the action is executed:

• Conditions in the add list become true
• Conditions in the delete list become false

Example

Robot moving from room A to room B.

Initial state:

Robot(A)

Goal:

Robot(B)

Action:

Move(A,B)

After execution:

Robot(B) becomes true.

STRIPS simplifies planning by representing state changes through logical predicates.

6.3 PDDL (Planning Domain Definition Language)

PDDL is a standardized language used to define planning problems.

It was introduced in the International Planning Competition (IPC) to allow planners to work with common benchmarks.

A PDDL problem consists of two main parts:

• Domain file
• Problem file

Domain File

Defines:

• Actions
• Predicates
• Types

Example structure:

(:action move
:parameters (?x ?y)
:precondition (at ?x)
:effect (and (at ?y) (not (at ?x))))

Problem File

Defines:

• Initial state
• Goal conditions
• Objects

Example:

(:init (at A))
(:goal (at B))

PDDL Versions

Different versions extend planning capabilities.

PDDL 1.2 – classical planning
PDDL 2.1 – temporal planning
PDDL 2.2 – derived predicates
PDDL 3.x – preferences and constraints

PDDL is widely used in automated planning systems and AI research competitions.

6.4 Planning as State-Space Search

In state-space planning, the planner searches through possible states generated by applying actions.

Two approaches are commonly used.

Forward State-Space Search

Also called progression planning.

Steps:

Start from initial state
Apply applicable actions
Generate successor states
Continue until goal is reached

This approach resembles traditional search algorithms.

Backward State-Space Search

Also called regression planning.

Steps:

Start from goal state
Identify actions that achieve the goal
Determine required preconditions
Continue until initial state is satisfied

Backward search often reduces the number of explored states.

6.5 Planning as Plan-Space Search (Partial-Order Planning – POP)

Plan-space search focuses on searching through possible plans instead of states.

Partial-order planning allows actions to remain partially ordered rather than completely sequential.

Example plan:

Pick up block
Move robot
Place block

Some actions may be executed in any order if they do not interfere.

Advantages of POP

• Flexible action ordering
• Parallel execution of actions
• Reduced search complexity

POP is useful in robotic task planning and workflow management systems.

6.6 Causal Links, Threats & Promotion/Demotion

In partial-order planning, relationships between actions are represented using causal links.

Causal Link

A causal link indicates that one action provides a condition required by another action.

Example:

Action A → produces condition P
Action B → requires condition P

This relationship is represented as:

A —P→ B

Threats

A threat occurs when an action may invalidate a causal link.

Example:

Action C deletes condition P between A and B.

This creates a conflict.

Threat Resolution

Two methods are used to resolve threats.

Promotion

Move the threatening action after the dependent action.

Demotion

Move the threatening action before the causal link.

These techniques maintain plan consistency.

6.7 Planning Graph – Mutexes, Level-Off, Graphplan Algorithm

Planning graphs provide a structured way to represent planning problems.

They consist of alternating layers of:

• State levels
• Action levels

Mutex (Mutual Exclusion)

Two actions are mutually exclusive if they cannot occur simultaneously.

Examples:

• Actions require conflicting resources
• One action deletes another’s precondition

Mutex relationships help eliminate impossible action combinations.

Level-Off

As the planning graph expands, eventually no new states or actions appear.

This point is called the level-off point.

Beyond this point, the graph structure remains unchanged.

Graphplan Algorithm

Graphplan uses the planning graph to efficiently find plans.

Steps:

Construct planning graph
Identify goal conditions at final level
Perform backward search through graph
Extract valid plan

Advantages:

• Efficient pruning using mutex relations
• Faster than many classical planners

Graphplan was a major breakthrough in automated planning.

Conclusion

Classical planning provides powerful frameworks for generating action sequences that achieve specific goals. Languages such as STRIPS and PDDL enable structured representation of planning problems, while techniques like state-space search and partial-order planning allow efficient plan generation.

Advanced tools such as planning graphs and the Graphplan algorithm improve planning efficiency by identifying mutually exclusive actions and pruning unnecessary search paths. These foundations form the basis for modern planning systems used in robotics, logistics, intelligent agents, and automated decision-making systems.

Chapter 7: Heuristic Search in Planning

Modern automated planning systems rely heavily on heuristic search techniques to efficiently explore enormous planning spaces. Instead of examining every possible sequence of actions, heuristic planners use heuristic estimates to guide search toward promising states.

This paradigm significantly improved the scalability of planning systems and led to the development of powerful planners such as HSP, Fast-Forward (FF), and LAMA, which can solve complex real-world planning problems.

7.1 Planning as Heuristic Search Paradigm (HSP, FF)

Heuristic planning treats planning as a search problem guided by heuristic evaluation functions.

In this approach:

• States represent world configurations
• Actions represent transitions between states
• A heuristic estimates the distance from a state to the goal

The evaluation function is typically written as:

f(n) = g(n) + h(n)

Where:

g(n) = cost of reaching node n
h(n) = estimated cost to goal

HSP (Heuristic Search Planner)

HSP was one of the first planners to use heuristic estimates derived from relaxed planning problems.

Key ideas:

• Ignore delete effects of actions
• Estimate goal distance from simplified problem

This makes heuristic computation easier while still providing useful guidance.

Fast-Forward (FF) Planner

The FF planner improved heuristic planning by extracting relaxed plans from planning graphs.

Advantages:

• Faster heuristic computation
• Improved guidance toward goals

FF became one of the most influential heuristic planning systems.

7.2 Delete Relaxation Heuristics

Delete relaxation simplifies planning problems by ignoring delete effects of actions.

In relaxed problems:

• Actions only add facts
• Facts never become false

This simplification allows efficient heuristic estimation.

h⁺ Heuristic

The h⁺ heuristic represents the optimal solution cost in the relaxed planning problem.

Although highly informative, computing h⁺ exactly is computationally expensive.

h_max Heuristic

The h_max heuristic estimates cost as the maximum cost of achieving individual goals.

h_{max}(s) = \max_{g \in Goals} cost(g)

Advantages:

• Fast computation
• Admissible heuristic

However, it often underestimates total cost.

h_add Heuristic

The h_add heuristic sums the costs of achieving individual goals.

h_{add}(s) = \sum_{g \in Goals} cost(g)

Advantages:

• Provides more informative estimates than h_max

Disadvantage:

• May overestimate cost (not admissible).

h_ff Heuristic

The h_ff heuristic is derived from the relaxed plan extracted by the Fast-Forward planner.

It estimates cost by counting the number of actions in the relaxed plan.

Advantages:

• Highly informative
• Efficient computation

7.3 Pattern Databases in Planning

Pattern databases store precomputed optimal costs for subproblems.

These databases are created by solving simplified versions of the planning problem offline.

Example:

In a logistics planning problem, a pattern database may store optimal costs for moving certain packages between locations.

Advantages:

• Strong heuristic estimates
• Significant reduction in search space

Pattern databases are widely used in puzzle solving and planning systems.

7.4 Landmark-Based Heuristics

Landmarks are facts or actions that must occur in every valid plan.

Example:

In a logistics domain:

If a package must reach city B, it must first be loaded onto a vehicle.

Thus:

Load(package, truck) is a landmark.

Landmark Graph

Landmarks can be organized in a graph representing ordering constraints.

Example:

Landmark 1 → Landmark 2 → Landmark 3

This structure helps guide the planner toward necessary steps.

Advantages

• Provides strong guidance
• Reduces unnecessary search

Landmark heuristics are used in advanced planners such as LAMA.

7.5 Fast-Forward (FF) Heuristic & Enforced Hill-Climbing

The Fast-Forward planner introduced an efficient heuristic search strategy called Enforced Hill-Climbing (EHC).

Enforced Hill-Climbing

Steps:

Evaluate heuristic value of current state
Expand neighbors
If better state found, move to it
If no improvement found, perform BFS until improvement appears

This hybrid approach combines:

• Local search (hill climbing)
• Systematic search (BFS)

Advantages:

• Very fast in many planning domains
• Avoids many local minima

7.6 LAMA Planner Family & Anytime Behavior

The LAMA planner is one of the most successful planning systems from the International Planning Competition.

Key features:

• Landmark-based heuristics
• Cost-sensitive search
• Anytime planning capability

Anytime Planning

Anytime planners produce an initial solution quickly and then improve it gradually.

Example:

Iteration 1 → suboptimal plan
Iteration 2 → improved plan
Iteration 3 → near-optimal plan

Advantages:

• Useful in time-critical environments
• Provides progressively better solutions

LAMA uses weighted A* search to achieve this behavior.

7.7 Modern Heuristic Planners: K* / Anytime A* in Planning

Modern planning systems combine heuristic search with anytime optimization techniques.

Anytime A*

Anytime A* starts with a weighted heuristic to find a solution quickly.

f(n) = g(n) + w \cdot h(n)

Where:

w > 1

After finding an initial solution, the algorithm reduces the weight and searches for better plans.

K* Algorithm

K* is a modern planning algorithm that focuses on:

• Efficient heuristic search
• Reduced memory usage
• Improved solution quality

It is particularly useful in large planning problems with complex constraints.

Conclusion

Heuristic search has transformed automated planning by enabling planners to solve problems with extremely large state spaces. Techniques such as delete relaxation heuristics, landmark heuristics, and pattern databases provide powerful estimates that guide search efficiently.

Planners like HSP, FF, and LAMA demonstrate how combining heuristic evaluation with advanced search strategies can produce high-quality plans quickly. Modern algorithms such as Anytime A* and K* continue to push the boundaries of planning efficiency.

These heuristic planning methods are widely used in robotics, logistics, workflow management, and autonomous systems, where efficient decision-making and plan generation are essential.

Chapter 8: Advanced & Hierarchical Planning Techniques
As planning problems grow in complexity, classical planning approaches may struggle due to the enormous size of the search space. Advanced planning techniques address this challenge by introducing hierarchical structures, temporal reasoning, probabilistic models, and multi-agent coordination.
These techniques allow AI systems to handle complex real-world planning problems such as robotics operations, logistics planning, autonomous vehicles, and distributed decision-making systems.
8.1 Hierarchical Task Network (HTN) Planning
Hierarchical Task Network (HTN) planning decomposes complex tasks into smaller, manageable subtasks using a hierarchical structure.
Instead of searching directly in the state space, HTN planners work with tasks and methods.
Key components:
• Tasks – activities to be accomplished
• Methods – rules for decomposing tasks into subtasks
• Operators – primitive actions executable in the environment
Example
High-level task:
Prepare Dinner
Decomposition:
Prepare Dinner
→ Cook Food
→ Set Table
→ Serve Meal
Each subtask can be further decomposed into smaller actions.
Advantages:
• Efficient search due to hierarchical structure
• Incorporates domain knowledge
• Suitable for complex real-world domains
HTN planning is widely used in robotics and workflow automation.
8.2 SHOP & SHOP2 – Domain Knowledge Encoding
SHOP (Simple Hierarchical Ordered Planner) is an HTN planner designed to generate plans in the same order they will be executed.
Unlike classical planners, SHOP uses forward decomposition, ensuring that the current state is always known.
Features of SHOP
• Ordered task decomposition
• Strong use of domain knowledge
• Deterministic planning
SHOP2
SHOP2 extends SHOP by supporting:
• Partial ordering of tasks
• Temporal planning
• Numeric reasoning
Example:
In a logistics domain, domain knowledge may specify:
Load package before transport
Transport before delivery
This guidance significantly reduces search complexity.
8.3 Angelic & Over-Approximation Planning
Angelic planning introduces abstract actions that represent sets of possible concrete actions.
Instead of evaluating every possible action sequence, the planner evaluates optimistic approximations.
Example:
Action:
Deliver Package
Possible implementations:
• Truck delivery
• Drone delivery
• Courier service
Angelic planning evaluates the best possible outcome of each abstraction.
Advantages:
• Reduces search complexity
• Enables high-level planning before detailed decisions
Over-approximation planning similarly simplifies planning problems by relaxing constraints and refining solutions later.
8.4 Temporal & Numeric Planning (PDDL 2.1+)
Real-world planning problems often involve time constraints and numeric resources.
Temporal planning extends classical planning to support durative actions.
Example:
Action:
Drive(A,B)
Duration:
30 minutes
Temporal Planning Concepts
• Action duration
• Temporal constraints
• Concurrent actions
Example:
A robot can charge its battery while monitoring sensors.
Numeric Planning
Numeric planning allows planners to reason about quantitative variables.
Examples:
• Fuel level
• Battery charge
• Resource consumption
Example PDDL expression:
(:effect (decrease (fuel-level) 10))
Temporal and numeric planning capabilities were introduced in PDDL 2.1 and later versions.
These features enable planning systems to handle realistic operational environments.
8.5 Probabilistic Planning (PPDDL, IPPC Benchmarks)
Many real-world environments contain uncertainty.
Probabilistic planning extends classical planning by modeling uncertain action outcomes.
Example:
Robot navigation:
Move Forward
Possible outcomes:
• Successful movement (90%)
• Slip or deviation (10%)
PPDDL (Probabilistic PDDL)
PPDDL extends PDDL to represent probabilistic effects.
Example:
(:effect
(probabilistic
0.9 (at B)
0.1 (at A)))
IPPC Benchmarks
The International Probabilistic Planning Competition (IPPC) provides benchmark problems for evaluating probabilistic planners.
Domains include:
• Robot navigation
• Logistics planning
• Traffic management
Probabilistic planners often use techniques from Markov Decision Processes (MDPs).
8.6 Continual / Online Planning & Replanning
Traditional planning assumes that the environment remains static during plan execution.
However, real-world environments are often dynamic.
Continual planning enables systems to:
• Monitor environment changes
• Adapt plans dynamically
• Replan when necessary
Example
Autonomous vehicle navigation.
If a road becomes blocked, the vehicle must recompute a new route.
Replanning Techniques
Common strategies include:
• Plan repair
• Incremental planning
• Rolling horizon planning
Continual planning ensures that agents remain adaptive and responsive in dynamic environments.
8.7 Multi-Agent Planning & Coordination
Many AI systems involve multiple agents working together or competing.
Multi-agent planning addresses coordination among agents.
Example:
Warehouse robots collaborating to transport packages.
Challenges
• Coordination among agents
• Conflict resolution
• Resource allocation
Coordination Mechanisms
Common approaches include:
• Centralized planning
• Distributed planning
• Negotiation protocols
Example:
Multiple drones coordinating to cover different surveillance areas.
Applications
Multi-agent planning is widely used in:
• Autonomous traffic systems
• Smart manufacturing
• Distributed robotics
• Multi-robot exploration
Conclusion
Advanced planning techniques significantly expand the capabilities of AI planning systems. Hierarchical planning methods such as HTN allow planners to decompose complex tasks into manageable subtasks, while domain-specific planners like SHOP and SHOP2 leverage expert knowledge to improve efficiency.
Temporal, numeric, and probabilistic planning enable planners to operate in realistic environments with time constraints and uncertainty. Continual planning ensures adaptability in dynamic environments, and multi-agent planning supports collaboration among intelligent agents.
Together, these advanced planning techniques form the foundation for modern AI systems capable of solving complex, real-world decision-making and coordination problems across domains such as robotics, logistics, and autonomous systems.
- Chapter 9: Constraint Satisfaction Problems (CSP) & Search
  Constraint Satisfaction Problems (CSPs) form an important class of problems in Artificial Intelligence where the objective is to find values for a set of variables that satisfy a collection of constraints. CSP frameworks are widely used in scheduling, planning, resource allocation, circuit design, and puzzle solving.
  Unlike standard search problems that focus on finding a path from an initial state to a goal state, CSPs focus on assigning values to variables while respecting constraints.
  Typical examples of CSP problems include:
  • Sudoku
  • Map coloring
  • Timetable scheduling
  • Resource allocation problems
  9.1 CSP Formulation: Variables, Domains, Constraints
  A Constraint Satisfaction Problem is defined by three main components:
  • Variables
  • Domains
  • Constraints
  Variables
  Variables represent the unknown values that must be determined.
  Example:
  In a map-coloring problem:
  Variables = {WA, NT, SA, Q, NSW, V, T}
  Each variable represents a region of the map.
  Domains
  The domain of a variable is the set of possible values it can take.
  Example:
  Domain = {Red, Green, Blue}
  Each region can be assigned one of these colors.
  Constraints
  Constraints restrict the combinations of values that variables can take.
  Example constraint:
  Adjacent regions cannot have the same color.
  Example:
  WA ≠ NT
  NT ≠ SA
  Constraints can be classified as:
  • Unary constraints (single variable)
  • Binary constraints (two variables)
  • Higher-order constraints (multiple variables)
  9.2 Backtracking Search & Constraint Propagation
  Backtracking search is the fundamental algorithm used to solve CSPs.
  It builds solutions incrementally and backtracks when a constraint is violated.
  Backtracking Algorithm
  Steps:
  1. Select an unassigned variable
  2. Assign a value from its domain
  3. Check whether constraints are satisfied
  4. Continue recursively
  5. If a conflict occurs, backtrack
  Example:
  Assign colors to regions one by one.
  If assigning a color violates a constraint, the algorithm tries another color.
  Constraint Propagation
  Constraint propagation reduces the search space by eliminating inconsistent values from domains.
  Example:
  If WA is assigned Red, then Red can be removed from NT's domain.
  Advantages:
  • Reduces branching factor
  • Improves efficiency of search
  Constraint propagation is often combined with backtracking.
  9.3 Arc Consistency (AC-3), Forward Checking, Maintaining Arc Consistency (MAC)
  Constraint propagation techniques enforce consistency between variables.
  Arc Consistency
  An arc between two variables is consistent if every value in one domain has a supporting value in the other domain.
  Example:
  Variables:
  X and Y
  Domains:
  X = {1,2,3}
  Y = {2,3}
  Constraint:
  X < Y
  Value 3 in X has no supporting value in Y (because no value in Y is greater than 3).
  Therefore:
  3 is removed from X's domain.
  AC-3 Algorithm
  AC-3 enforces arc consistency for all variable pairs.
  Steps:
  1. Place all arcs in a queue
  2. Remove an arc from queue
  3. Check consistency
  4. Remove inconsistent values
  5. Add affected arcs back to queue
  AC-3 is widely used because it is simple and effective.
  Forward Checking
  Forward checking prevents future conflicts during search.
  After assigning a variable, the algorithm removes inconsistent values from neighboring domains.
  Example:
  If WA = Red, remove Red from NT and SA domains.
  Forward checking detects conflicts earlier than plain backtracking.
  Maintaining Arc Consistency (MAC)
  MAC combines backtracking search with arc consistency.
  Whenever a variable is assigned:
  • Arc consistency is enforced on remaining variables.
  Advantages:
  • Stronger pruning of search space
  • Often much faster than simple backtracking
  9.4 Variable & Value Ordering Heuristics
  Heuristics significantly improve CSP search efficiency.
  MRV (Minimum Remaining Values)
  MRV selects the variable with the smallest remaining domain.
  Example:
  Variable A → {1,2}
  Variable B → {1,2,3,4}
  MRV chooses variable A.
  Reason:
  It is most constrained and likely to cause failure earlier.
  DEG (Degree Heuristic)
  The degree heuristic selects the variable involved in the largest number of constraints with other variables.
  Example:
  A variable connected to many neighbors is selected first.
  This helps reduce future conflicts.
  LCV (Least Constraining Value)
  LCV chooses the value that rules out the fewest choices for neighboring variables.
  Example:
  If assigning value 2 eliminates fewer possibilities than assigning value 1, then choose value 2.
  Advantages:
  • Maintains flexibility in remaining variables
  9.5 CSP vs Classical Planning – Connections
  Although CSPs and classical planning address different types of problems, they share several similarities.
  Similarities
  Both involve:
  • State representation
  • Constraints
  • Search algorithms
  Both approaches aim to find solutions that satisfy certain conditions.
  Differences
  AspectCSPClassical PlanningRepresentationVariables and domainsStates and actionsGoalConsistent assignmentsSequence of actionsSearch spaceVariable assignmentsState transitions
  Example
  Scheduling problem:
  Assign times to tasks without conflicts.
  This can be modeled as:
  • CSP → assigning time slots
  • Planning → sequence of task executions
  In some domains, planning problems can be transformed into CSP formulations.
  Conclusion
  Constraint Satisfaction Problems provide a powerful framework for solving problems that involve assigning values while respecting constraints. By representing problems in terms of variables, domains, and constraints, CSP techniques enable efficient reasoning about complex decision spaces.
  Algorithms such as backtracking search, forward checking, and arc consistency (AC-3) help reduce the search space, while heuristics such as MRV, degree heuristic, and least-constraining value significantly improve efficiency.
  CSP methods are widely used in scheduling, resource allocation, puzzle solving, and planning systems, making them an essential component of modern Artificial Intelligence problem-solving techniques
  Chapter 10: Evaluation, Benchmarks & Practical Considerations
  Developing planning and search algorithms is only one part of building intelligent systems. Equally important is evaluating algorithm performance, testing on benchmark domains, and understanding real-world implementation challenges.
  Researchers and practitioners measure how well planning algorithms perform using standardized benchmarks and evaluation metrics. These evaluations help compare different planning systems and determine their suitability for practical applications.
  Planning systems are widely tested in domains such as robotics navigation, logistics planning, puzzle solving, and automated reasoning.
  10.1 Standard Benchmarks
  Benchmark problems allow researchers to test planning algorithms under consistent and comparable conditions.
  Several well-known benchmark environments are widely used in AI research.
  International Planning Competition (IPC)
  The International Planning Competition (IPC) is one of the most important benchmarks for evaluating planning systems.
  It provides standardized planning domains defined in PDDL (Planning Domain Definition Language).
  Common IPC domains include:
  • Logistics planning
  • Satellite scheduling
  • Block stacking
  • Rover navigation
  Planners such as Fast Downward, LAMA, and FF are often evaluated using IPC benchmarks.
  Gridworld
  Gridworld is a simple environment where an agent moves in a grid to reach a goal.
  Example:
  S → → → G
  Where:
  S = start state
  G = goal state
  Gridworld is commonly used to test search algorithms, reinforcement learning agents, and planning systems.
  Sokoban
  Sokoban is a puzzle game where an agent pushes boxes to target locations.
  Example problem:
  Move boxes to goal squares without blocking paths.
  Challenges:
  • Large search space
  • Deadlock situations
  Sokoban is widely used to evaluate heuristic search algorithms.
  Logistics Domain
  Logistics problems involve transporting packages between locations using vehicles.
  Example actions:
  • Load package into truck
  • Move truck to another city
  • Unload package
  This domain is useful for evaluating planning systems and scheduling algorithms.
  10.2 Performance Metrics
  Several metrics are used to evaluate planning algorithms.
  Plan Length
  Plan length refers to the number of actions required to achieve the goal.
  Example:
  Plan:
  1. Load package
  2. Move truck
  3. Unload package
  Plan length = 3.
  Shorter plans often indicate better solutions.
  Makespan
  Makespan measures the total execution time of a plan, considering parallel actions.
  Example:
  Two actions executed simultaneously reduce makespan.
  Makespan is important in temporal planning problems.
  CPU Time
  CPU time measures how long the algorithm takes to compute a solution.
  Example:
  Planner A → 2 seconds
  Planner B → 10 seconds
  Lower CPU time indicates faster algorithms.
  Memory Usage
  Memory consumption is another important metric.
  Some algorithms require storing large numbers of states.
  Example:
  Breadth-first search may require large memory for storing frontier nodes.
  Efficient planners balance time and memory usage.
  10.3 Optimality vs Speed Trade-offs
  In many planning problems, there is a trade-off between solution optimality and computational speed.
  Optimal Planning
  Optimal planners guarantee the best possible solution.
  Example:
  A* search with admissible heuristics.
  Advantages:
  • Guaranteed best plan
  Disadvantages:
  • High computational cost.
  Approximate Planning
  Approximate planners produce good solutions quickly but may not guarantee optimality.
  Example:
  Greedy search or hill climbing.
  Advantages:
  • Faster computation
  • Suitable for real-time applications
  Example:
  Autonomous robots often prioritize speed over optimality.
  10.4 Handling Uncertainty & Partial Observability
  Real-world environments often contain uncertainty and incomplete information.
  Planning systems must handle:
  • Non-deterministic actions
  • Partial observability
  • Uncertain outcomes
  Non-Deterministic Planning
  In non-deterministic environments, actions may produce multiple possible outcomes.
  Example:
  Robot navigation.
  Move forward may result in:
  • Successful movement
  • Slippage
  Planners must consider all possible outcomes.
  Partial Observability
  In partially observable environments, the agent cannot fully observe the current state.
  Example:
  Robot exploring an unknown environment.
  The agent must reason with incomplete information.
  Introduction to POMDPs
  Partially Observable Markov Decision Processes (POMDPs) provide a mathematical framework for decision-making under uncertainty.
  Key components:
  • States
  • Actions
  • Observations
  • Rewards
  The agent maintains a belief state, representing probability distribution over possible states.
  POMDPs are used in:
  • Robotics navigation
  • Autonomous systems
  • Medical decision support
  10.5 Implementation Tips
  Practical implementation of planning algorithms often relies on specialized software libraries.
  Python Tools
  Python offers several libraries for implementing AI search and planning algorithms.
  simpleai
  A lightweight Python library for search algorithms.
  Example:
  from simpleai.search import astar
  Supports:
  • A* search
  • BFS and DFS
  • Local search algorithms
  unified-planning
  Unified Planning is a modern Python framework for automated planning.
  Features include:
  • PDDL support
  • Multiple planners
  • Planning problem modeling
  Example:
  from unified_planning.shortcuts import *
  This library simplifies building and solving planning problems.
  C++ Planning Systems
  Many high-performance planners are implemented in C++.
  Fast Downward
  Fast Downward is a state-of-the-art planning system.
  Features:
  • Heuristic search planning
  • Landmark heuristics
  • PDDL support
  Used in many planning competitions.
  OpenCog
  OpenCog is an AI framework that integrates:
  • reasoning
  • planning
  • knowledge representation
  It is designed for building advanced cognitive systems.
  Conclusion
  Evaluating planning and search algorithms requires standardized benchmarks and meaningful performance metrics. Domains such as Gridworld, Sokoban, and logistics planning provide controlled environments for testing algorithms, while competitions like the International Planning Competition (IPC) help advance planning research.
  Performance metrics such as plan length, makespan, CPU time, and memory usage allow researchers to compare algorithm efficiency and effectiveness. In real-world applications, planners must often balance optimality with computational speed.
  Modern planning systems must also address challenges such as uncertainty, partial observability, and dynamic environments. Tools such as simpleai, unified-planning, Fast Downward, and OpenCog provide practical platforms for implementing and experimenting with planning algorithms.
  Together, these evaluation methods and tools support the development of robust, scalable, and practical AI planning systems for complex real-world problems.
  Chapter 11: Applications & Case Studies
  Artificial Intelligence search and planning techniques are widely applied across many real-world domains. From robotics and autonomous vehicles to logistics and game AI, these algorithms help systems make intelligent decisions, plan sequences of actions, and optimize complex processes.
  This chapter explores several practical applications of search and planning algorithms and demonstrates how they are used in real-world systems.
  11.1 Robotics: Motion Planning, Task & Motion Planning (TAMP)
  Robotics is one of the most important application areas of AI planning.
  Robots must plan how to move, interact with objects, and complete tasks efficiently.
  Motion Planning
  Motion planning focuses on finding a path for a robot to move from one location to another while avoiding obstacles.
  Example:
  Robot navigation in a warehouse.
  Initial State → Robot position A
  Goal State → Robot position B
  Common algorithms used include:
  • A* search
  • Rapidly-exploring Random Trees (RRT)
  • Probabilistic Roadmaps (PRM)
  These algorithms ensure that robots move safely through complex environments.
  Task & Motion Planning (TAMP)
  Task and Motion Planning integrates symbolic planning with geometric motion planning.
  Example scenario:
  Robot assembling a product.
  Tasks:
  1. Pick up component
  2. Move arm
  3. Attach component
  The planner must consider:
  • Task sequence
  • Robot arm movement
  • Collision avoidance
  TAMP systems combine planning techniques with motion planning algorithms to create realistic robotic behaviors.
  11.2 Autonomous Vehicles & Drones: Path Planning
  Autonomous vehicles rely heavily on search and planning algorithms to navigate safely.
  Path Planning
  The vehicle must compute the best route from its current position to a destination while avoiding obstacles and traffic conditions.
  Example algorithms:
  • A* search
  • Dijkstra's algorithm
  • Hybrid A*
  These algorithms consider:
  • Road networks
  • Traffic rules
  • Dynamic obstacles
  Drone Navigation
  Drones require three-dimensional path planning.
  Example tasks:
  • Delivery route optimization
  • Surveillance operations
  • Environmental monitoring
  Planning systems help drones determine energy-efficient and collision-free flight paths.
  11.3 Games & Virtual Agents: NPC Behavior, Strategy Games
  Artificial Intelligence techniques are widely used in video games to control non-player characters (NPCs) and strategy systems.
  NPC Behavior
  Game AI often uses planning systems to generate intelligent behaviors.
  Example:
  An enemy character in a game may:
  1. Search for the player
  2. Hide behind cover
  3. Attack strategically
  Planning algorithms help generate believable behaviors.
  Strategy Games
  Complex games such as real-time strategy games require AI systems to manage:
  • Resource collection
  • Unit movement
  • Combat strategies
  Algorithms used include:
  • Minimax search
  • Monte Carlo Tree Search
  • Reinforcement learning
  These techniques enable AI agents to compete effectively against human players.
  11.4 Logistics & Scheduling
  Planning algorithms are widely used in industrial and transportation systems.
  Job-Shop Scheduling
  In manufacturing, machines must process tasks in a specific order.
  Example:
  Machine A → Task 1
  Machine B → Task 2
  Constraints:
  • Limited resources
  • Time deadlines
  • Task dependencies
  Planning algorithms optimize schedules to minimize production time.
  Vehicle Routing
  Vehicle routing problems involve determining the most efficient delivery routes.
  Example:
  Delivery trucks distributing packages across multiple locations.
  Objectives:
  • Minimize travel distance
  • Reduce fuel consumption
  • Meet delivery deadlines
  Search algorithms and optimization techniques help generate efficient delivery plans.
  11.5 Natural Language Generation & Dialogue Planning
  Planning techniques are also used in natural language processing systems.
  Natural Language Generation (NLG)
  NLG systems generate human-readable text from structured data.
  Example:
  Weather report generation.
  Input:
  Temperature = 28°C
  Condition = Sunny
  Output:
  "Today will be sunny with a temperature of 28 degrees."
  Planning algorithms determine:
  • Content selection
  • Sentence structure
  • Logical ordering of information
  Dialogue Planning
  Dialogue systems such as chatbots must plan conversations to achieve specific goals.
  Example:
  Customer support chatbot.
  Conversation plan:
  1. Identify user issue
  2. Provide troubleshooting steps
  3. Confirm resolution
  Planning systems help maintain coherent and goal-directed dialogues.
  11.6 End-to-End Project Examples
  Practical projects help demonstrate how AI planning and search techniques work in real systems.
  8-Puzzle Solver
  The 8-puzzle is a classic search problem.
  Goal:
  Arrange tiles in correct order.
  Example initial state:
  1 2 3
  4 5 6
  _ 7 8
  Search algorithms such as A* are commonly used to solve the puzzle efficiently.
  Blocks World Planner
  Blocks World is a well-known planning problem.
  Example:
  Blocks stacked on a table must be rearranged to achieve a target configuration.
  Actions include:
  • Pick up block
  • Place block on another block
  Planning algorithms generate sequences of actions that achieve the goal configuration.
  Simple Game AI
  Game AI projects demonstrate adversarial search techniques.
  Example:
  Tic-Tac-Toe AI using minimax.
  Steps:
  1. Generate possible moves
  2. Evaluate board states
  3. Choose optimal move
  This type of project illustrates how AI algorithms make strategic decisions.
  Conclusion
  Artificial Intelligence planning and search techniques play a crucial role in solving real-world problems across diverse domains. From robotics and autonomous vehicles to logistics optimization and game AI, these algorithms enable systems to reason about actions, anticipate outcomes, and generate effective strategies.
  Applications such as motion planning, job scheduling, and dialogue management demonstrate how theoretical AI concepts translate into practical solutions. End-to-end projects like the 8-puzzle solver and Blocks World planner further illustrate how search and planning algorithms can be implemented in real systems.
  As AI technology continues to evolve, planning techniques will remain fundamental for building intelligent systems capable of complex decision-making and autonomous operation.
  Chapter 12: Current Trends & Open Research Directions (2025–2026)
  Artificial Intelligence planning and search research continues to evolve rapidly. With the rise of deep learning, large language models, and hybrid AI systems, modern research focuses on combining traditional symbolic reasoning with data-driven learning approaches.
  These emerging directions aim to make planning systems more scalable, adaptive, and capable of handling real-world complexity. This chapter highlights major research trends shaping AI planning between 2025 and 2026.
  12.1 Neural Heuristics & Learning to Search
  Traditional planning systems rely on handcrafted heuristics to guide search algorithms. However, designing effective heuristics manually can be difficult for complex domains.
  Recent research focuses on learning heuristics using neural networks.
  Neural Heuristic Models
  Neural networks can learn heuristic functions by analyzing large datasets of solved planning problems.
  Instead of manually designing a heuristic function h(n), a neural network approximates the function:
  h(n) \approx NeuralNetwork(n)
  Advantages:
  • Automatic heuristic generation
  • Improved performance in complex domains
  • Ability to generalize across problem instances
  Learning to Search
  Learning-to-search methods train models to predict promising actions during search.
  Example:
  Instead of exploring every possible action, a learned model predicts the most useful actions to expand.
  Applications include:
  • Robotics task planning
  • Automated theorem proving
  • Game playing systems
  12.2 Large Language Models as Planners
  Large Language Models (LLMs) have recently demonstrated impressive reasoning capabilities. Researchers are exploring how these models can assist or perform planning tasks.
  ReAct Framework
  ReAct (Reason + Act) combines reasoning with action execution.
  The system alternates between:
  • Reasoning about the problem
  • Executing actions in the environment
  Example workflow:
  Thought → Action → Observation → Next Thought
  This enables LLMs to interact with external tools and environments.
  Plan-and-Solve Prompting
  Plan-and-Solve prompting encourages LLMs to first generate a high-level plan and then execute each step sequentially.
  Example:
  Problem: Solve a complex task.
  Step 1: Generate plan
  Step 2: Execute each subtask
  Advantages:
  • Reduces reasoning errors
  • Improves problem-solving accuracy
  LLMs are increasingly used in autonomous agents and intelligent assistants.
  12.3 Neuro-Symbolic Planning
  Neuro-symbolic AI integrates neural networks with symbolic reasoning systems.
  Traditional symbolic planners excel at logical reasoning but struggle with perception tasks. Neural networks excel at perception but lack explicit reasoning abilities.
  Combining both approaches provides powerful capabilities.
  Hybrid Architecture
  Example system:
  Neural Network → Perception and pattern recognition
  Symbolic Planner → Logical reasoning and planning
  Example application:
  Autonomous robots.
  Steps:
  1. Neural network detects objects in environment
  2. Symbolic planner decides sequence of actions
  Advantages:
  • Combines learning and reasoning
  • Improves explainability
  • Handles complex real-world environments
  12.4 Scalable Planning in Real-World Domains
  One of the major challenges in AI planning is scalability.
  Real-world systems involve thousands or millions of possible states.
  Researchers are developing planning methods for large-scale domains such as:
  Energy Systems
  AI planners optimize power grid operations.
  Example tasks:
  • Balancing energy supply and demand
  • Scheduling renewable energy sources
  • Preventing system failures
  Planning algorithms help improve energy efficiency and sustainability.
  Healthcare Systems
  Planning techniques assist healthcare systems in:
  • Patient scheduling
  • Resource allocation
  • Treatment planning
  Example:
  Hospital planning system assigning doctors and equipment to patients.
  Climate Modeling
  AI planning systems help simulate and optimize environmental policies.
  Example:
  • Carbon reduction strategies
  • Disaster response planning
  • Sustainable resource management
  These applications require planners that can handle massive datasets and complex constraints.
  12.5 Integration with Reinforcement Learning
  Modern AI systems increasingly combine search algorithms with reinforcement learning.
  This hybrid approach improves decision-making in complex environments.
  Search + Reinforcement Learning
  Search algorithms help explore possible solutions, while reinforcement learning improves strategies over time.
  Example:
  AlphaGo system.
  Components:
  • Neural networks
  • Monte Carlo Tree Search
  • Reinforcement learning
  The combination allowed AlphaGo to achieve superhuman performance in the game of Go.
  Benefits of Hybrid Systems
  • Faster learning
  • Better decision-making
  • Improved generalization
  Hybrid search and reinforcement learning systems are now widely used in:
  • Robotics
  • Autonomous vehicles
  • Strategic games
  • Resource management
  Conclusion
  Recent advances in Artificial Intelligence are transforming planning and search research. Neural heuristics and learning-to-search techniques enable planners to automatically learn better search strategies. Large Language Models are beginning to perform planning tasks using frameworks such as ReAct and Plan-and-Solve prompting.
  Neuro-symbolic systems combine neural perception with symbolic reasoning, enabling more robust AI systems. Meanwhile, scalable planning methods are being applied to real-world domains such as energy management, healthcare systems, and climate modeling.
  Finally, hybrid approaches integrating search algorithms with reinforcement learning have demonstrated remarkable success in complex environments. These developments represent some of the most promising research directions for the future of intelligent planning systems and autonomous decision-making technologies.

PREVIOUS PAGE INDEX PAGE NEXT PAGE

Join AI Learning

Get free AI tutorials and PDFs

Email-ibm.anshuman@gmail.com

All my books are exclusively available on Amazon. The free notes/materials on globalcodemaster.com do NOT match even 1% with any of my PUBLISHED BOoks. Similar topics ≠ same content. Books have full details, exercises, chapters & structure — website notes do not.No book content is shared here. We fully comply with Amazon policies.

Free Reading Alert! All my books are FREE on Kindle Unlimited or eBooks just ₹145!

Check now: https://www.amazon.in/stores/Anshuman-Mishra/author/B0DQVNPL7P

Start reading! 🚀

🚀 Best content for SSC, CGL, LDC, TET, NET & SET preparation!
📚 Maths | Reasoning | GK | Previous Year Questions | Tips & Tricks

👉 Join our WhatsApp Channel now:
🔗 https://whatsapp.com/channel/0029Vb6kg2vFnSz4zknEOG1D...

AI Mastery

Free Reading Alert! All my books are FREE on Kindle Unlimited or eBooks just ₹145!

Check now: https://www.amazon.in/stores/Anshuman-Mishra/author/B0DQVNPL7P

Start reading! 🚀

PREVIOUS PAGE INDEX PAGE NEXT PAGE

Search & Planning in AI: Heuristic Algorithms aur Problem-Solving Techniques

A Comprehensive Study Tutorial for Students, Researchers, and Professionals

TABLE OF CONTENT

Chapter 1: Introduction to Problem Solving & Search in Artificial Intelligence

1.1 What is Problem Solving in Artificial Intelligence?

1.2 Agents, Environments & Problem Types

Intelligent Agent

Environment Characteristics

1.3 Problem Formulation: States, Actions, Goals, Path Cost, Solution Quality

States

Actions

Goal State

Path Cost

Solution Quality

1.4 Toy Examples & Canonical Problems

1.4.1 8-Puzzle / 15-Puzzle

1.4.2 Traveling Salesman Problem (TSP)

1.4.3 Route Finding / Navigation

1.4.4 Vacuum World, Blocks World, Missionaries & Cannibals

1.5 Measuring Performance: Completeness, Optimality, Time & Space Complexity

Completeness

Optimality

Time Complexity

Space Complexity

1.6 Search vs Planning: Key Differences & Overlaps

Search

Planning

Differences

Conclusion

Chapter 2: Uninformed (Blind) Search Strategies

2.1 Tree Search vs Graph Search (Avoiding Cycles)

Tree Search

Graph Search

2.2 Breadth-First Search (BFS) – Properties & Implementation

BFS Algorithm Steps

Example

BFS Properties

BFS Implementation (Python)

2.3 Depth-First Search (DFS) – Properties, Stack Overflow Risk

DFS Algorithm Steps

Example Traversal

DFS Properties

Stack Overflow Risk

2.4 Depth-Limited Search & Iterative Deepening Search (IDS)

Depth-Limited Search

Iterative Deepening Search (IDS)

IDS Properties

2.5 Uniform-Cost Search (UCS / Dijkstra’s in Search Context)

UCS Evaluation Function

Example

UCS Properties

2.6 Bidirectional Search

Example

Advantages

Limitations

2.7 Comparison Table: Completeness, Optimality, Time/Space Complexity

Conclusion

Chapter 3: Informed (Heuristic) Search – Core Concepts

3.1 What is a Heuristic? Admissible, Consistent (Monotonic), Dominance

Admissible Heuristic

Consistent (Monotonic) Heuristic

Dominance

3.2 Heuristic Functions: Design Principles & Examples

3.2.1 Manhattan Distance, Euclidean Distance, Misplaced Tiles

3.2.2 Pattern Databases & Disjoint Pattern Databases

3.2.3 Domain-Independent Heuristics

3.3 Greedy Best-First Search (Pure Heuristic Guidance)

Advantages

Disadvantages

3.4 A* Search – Optimal & Complete with Admissible Heuristics

Advantages of A*

3.5 Properties of A*: Optimality Proof, Consistency vs Admissibility

Consistency vs Admissibility

3.6 Weighted A* & Anytime Variants

3.7 IDA* (Iterative Deepening A*) – Memory-Efficient Optimal Search

3.8 Real-Time Variants: LRTA, RTA, ARA*