Hybrid
Controls for Integrated Planning and Control
|
Research Description: |
|||
|
Instead of
attempting to design a complicated global control-law, much simpler
control policies are defined over local regions. Composing these
relatively simple policies induces a piecewise continuous vector field
over the union of the policy domains. A policy is
specified by a configuration dependent vector field defined over a
local region of the robot's free configuration space, termed a cell.
From this vector field and knowledge of the current robot state, the
control inputs are determined such that the closed-loop dynamics flows
to the specified policy goal set within the cell.
A discrete
transition relation, which can be represented by a graph, is induced by
the transition between the domain of one policy to the domain of a
second policy containing the goal set of the first policy.
On-line planning, and re-planning under changing conditions, becomes
more tractable on the graph, allowing us to bring numerous discrete
planning tools to bear on this essentially continuous problem. By
sequencing the local policies according to the ordering determined by a
discrete planner, the closed loop dynamics induce the discrete
transitions desired by the discrete plan. The overall hybrid
(switched) control policy responds to system perturbations without the
need for re-planning. In the face of changing environmental
conditions, the discrete graph allows for fast online re-planning,
while continuing to respect the system constraints. By
coupling planning and control in this way, the hybrid control system
plans in the discrete space of control policies; thus, the |
|||
|
Simple Systems: |
|||
|
Our early work focused on
simple systems where the robot was considered a fully actuated point in
its configuration space. Two basic policy types were developed
over convex polytopes. The first type of policy generated a
vector field whose integral curves flowed through a designated goal
set, as shown in the left-hand figure below. The second policy
type used a vector field that converged to a single point contained in
the interior. For kinematic systems, the control law
simply followed the vector field. For second-order systems, the
control law converged to the vector field flow in a way that guaranteed
the system did not exit the cell except via the designated goal
set. By properly scaling the vector field, the control policies
could be applied to systems with velocity and/or acceleration
constraints. Given a convex decomposition of the free configuration space,
these two types of policies can be composed to generate a globally
convergent switched control policy as shown on the right-hand figure
below. By allowing overlapping policy cells, the global policy domain
in state space can be increased for second-order systems. While a
small region may not be able to steer a high speed initial condition, a
larger region may be able to capture and steer the initial
condition. Consider the cases shown in the three figures
below. The same policy deployment and ordering is used for
three different initial conditions. In the left-hand figure, the
system starts from rest and switches through three policies before
converging to the goal. For the middle figure, the system starts
with a non-zero velocity that requires a different initial policy; this
results in a different induced path. Finally, the right-hand
figure starts with high initial speed. The policy defined over
the large rectangle to the left of the central obstacle applies braking
and steering to capture the system and bring it into the domain of
another policy. At this point, the system takes the path around
the central obstacle. This policy-based switching is
based entirely on the state of the system, and the ordering of the
policies. The system is reactive and does not require
re-planning. In these plots, the large red dots indicate a dedicated braking control law, the blue dots indicate the basic control that steers to a specific facet of the local convex region. The green dots indicate that a convergent control policy is active. |
|||
|
Nonholonomic Convex-bodied
Systems: |
|||
|
Our current work seeks to
extend the results to wheeled mobile robots, where the wheel-to-ground
contact induces nonholonomic velocity constraints on the system.
Realistic wheeled mobile robots also have constraints on speed and
acceleration that must be considered by the control
policies. Finally, many robots are non-circular;
therefore, the control policies must consider the robot shape and
orientation in cluttered environments.
As in the simple case, the local feedback control policies are defined over local regions of the robot configuration space, which we term cells. Each cell has a designated goal set; the cells are deployed such that the goal set of one policy is contained in the interior of another cell. Over each cell, the local control policy induces a vector field whose flow leads to the designated goal set. In order to realize the sequential composition approach for these nonholonomic systems with multiple constraints, each policy must satisfy four requirements. First, the cell must have simple representations that allow fast inclusion tests. That is, the test to see if a particular policy is valid must be fast. Second, the cell must be contained in the free configuration
space. This can be tested by expanding the cell to account for
the extent of the robot body at each orientation on the cell boundary,
and projecting this expanded cell into the workspace. The
projected set, which represents the swept volume of the robot body over
the configurations in the cell, is then tested for intersection with a
workspace obstacle. We have developed a non-conservative analytic
mapping for determining the expanded cell given a piecewise analytic
cell boundary and analytic definition of the robot body; see CMU-RI-TR-06-34
for more details. Third, given the input constraints and nonholonomic velocity constraints, it must be possible to generate a velocity that keeps the system within the cell on the cell boundary, with the exception of points contained in the goal set. This guarantees that the control system can keep the robot configuration within the cell. Coupled with the free space requirement in the second condition, this guarantees that the policy can be executed in a way that avoids collision. Finally, the control policy must guarantee convergence to some configuration in the designated goal set in finite time for any point on the cell interior. This requirement guarantees that the ordered policies will transition to a higher priority policy, and thus eventually reach the overall goal. This research has developed policies that satisfy these four
requirements, enabling the implementation of the approach on the robot
pictured above. The policies have been deployed in the robot
environment shown below, and the discrete policy graph that represents
transitions between policies specified. A partial deployment of
the policies is shown below. Given the policy graph encoding the discrete transition
relations among policies, a discrete planner determines an ordering of
the policies given an overall goal. Given an initial condition
for the robot, the control system searches for the highest priority
policy that contains the initial condition. From then on, the
policies are executed according to the specified ordering. An
explicit path is never planned; the resulting closed loop dynamics
induce a path that satisfies the system constraints and reaches the
overall goal. Results for four initial conditions, executed
on the non-circular laboratory mobile robot, are shown at the top of
this page. Two additional experimental runs are shown below. The policies and policy graph are also amenable to use with discrete planners using temporal logic to specify multiple goals, as detailed in CMU-RI-TR-06-34. |
|
Personnel: |
|
Publications: |
|
Composition of Local
Potential Functions for Global Robot Control and Navigation (IROS '03)
|
|
|
Last
updated February 29, 2008
©
Copyright 2006-2008
David C. Conner, All Rights Reserved.