PAGE 1
REACHABLE SET CONTROLFORPREFERREDAXIS HOMING MISSILESByDONALDJ.CAUGHLIN,JR. A DISSERTATION PRESENTED TOTHEGRADUATESCHOOL OFTHEUNIVERSITYOFFLORIDA IN PARTIALFULFILLMENTOFTHEREQUIREMENTS FORTHEDEGREEOFDOCTOR OF PHILOSOPHY UNIVERSITYOFFLORIDA1988
PAGE 2
Copyright 1988ByDONALD J.CAUGHLINJR.
PAGE 3
To BarbaraAmyJon
PAGE 4
ACKNOWLEDGMENTS The author wishestoexpress his gratitude to his committee chairman, Dr. T.E Bullock, for his instruction, helpful suggestions, and encouragement. Appreciationisalso expressed for the support and many helpful commentsfromthe other committee members, Dr. Basile, Dr. Couch, Dr. Smith, and Dr. Svoronos. iv
PAGE 5
TABLE OF CONTENTS ACKNOWLEDGMENTSivLIST OF FIGURES viiKEYTO SyMBOLS ix ABSTRACTxivCHAPTER I INTRODUCTION 1 IIBACKGROUND4 Missile Dynamics 5 Linear Accelerations 6 Moment Equations (i Linear Quadratic Gaussian Control Law 7 III CONSTRAINED CONTROL13IV CONSTRAINED CONTROL WITH UNMODELED SETPOINT ANDPLANTVARIA TIONS25Linear Optimal Control with Uncertainty and Constraints31Control Technique32Discussion36Procedure.37V REACHABLE SET CONTROL EXAMPLE.41Performance Comparison Reachable SetandLQGControl....41Summary.54VI REACHABLE SET CONTROL FORPREFERREDAXIS HOMING MISSILES55Acceleration Control.56System Model.56Disturbance Model58Reference Model.60Roll Control. 62 Definition 62 Controller 66 Kalman FiIter67Reachable Set Controller68Structure 68 Application 72 v
PAGE 6
VII RESULTS AND DISCUSSION76Simulation77Trajectory Parameters78Results78Deterministic Results78Stochastic Results81Conclusions87Reachable Set Control. 87 Singer Model. 87APPENDIXA SIMULATION RESUL TS88BSAMPLED-DATA CONVERSION 94 System Model 94 Sampled Data Equations 96 System 96 Target Disturbance 98 Minimum Control Reference 99 Summary 1 00 C SAMPLED DATA COST FUNCTIONS101DLQGCONTROLLERDECOMPOSITION.107 ECONTROLLERPARAMETERS111Control Law111Filter 112 LISTOFREFERENCES 113 BIOGRAPHICAL SKETCH 117 vi
PAGE 7
LISTOFFIGURES Figure Page2.1Missile Reference System 44.1Feedback System and Notation284.2 Reachable Set Control Objective334.3 IntersectionofMissile Reachable Sets Based on Uncertain Target Motion and Symmetric Constraints.384.4 IntersectionofMissile Reachable Sets Based on Uncertain Target Motion and U nsymmetric Constraints385.1Terminal PerformanceofLinear Optimal Control...435.2 Initial AccelerationofLinear Optimal Control.435.3 Linear Optimal AccelerationvsTime455.4 Linear Optimal VelocityvsTime455.5 Linear Optimal PositionvsTime lt6 5.6 Unconstrained and Constrained Acceleration .47 5.7 Unconstrained and Constrained Velocity vs Time.485.8 Unconstrained and Constrained PositionvsTime.485.9 Acceleration Profile With and Without Target Set Uncertainty 50 5.10 VelocityvsTime With and Without Target Set Uncertainty 505.11PositionvsTime With and Without Target Set Uncertainty515.12 AccelerationvsTime LQG and Reachable Set Control. 52 vii
PAGE 8
5.13 VelocityvsTime LQG and Reachable Set Control.535.14 PositionvsTime LQG and Reachable Set Control.536.1Reachable Set Control Disturbance processes606.2. Roll Angle Error Definition from Seeker Angles636.3. Roll Control Zones 65 6.4 Target Missile System746.5 Command Generator/Tracker757.1RMS Missile Acceleration767.2 Engagement Geometry777.3 Deterministic Results807.4 Stochastic Results817.5 MeasuredvsActual Z Axis Velocity 84 7.6 Performance Using Position Estimates and Actual Velocities86A.I XY Missile&Target Positions Reachable Set Control.89A.2 XY Missile&Target Positions Baseline Control Law89A.3XZMissile&Target Positions Reachable Set Control.90AAXZMissile&Target Positions Baseline Control Law 90 A.5 Missile Acceleration Reachable Set Control...91A.6 Missile Acceleration Baseline Control Law91A.7 Missile Roll Commands&Rate Reachable Set Control.92A.8 Missile Roll Commands&Rate Baseline Control Law92A.9 Missile Roll Angle Error93viii
PAGE 9
a()B()q.)D()DODOwtDoc DocDq DO E() F() G()KEYTO SYMBOLS Reference controlinputvector. Missile inertial x axis acceleration. Target inertial x axis acceleration. Specific force (drag) along X body axis. Desired linear acceleration about Z and Y body axes. Reference controlinputmatrix. Reference stateoutputmatrix. Feedforward stateoutputmatrix. Stability parameter Equilibrium drag coefficient. Stability parameter Change in drag due to weight. Stability parameter Changeindrag due to velocity. Stability parameter Change in drag due to angleofattack. Stability parameter Change in drag due to angleofattack rate. Stability parameter Change in drag due to pitch rate. Stability parameter Change in drag due to pitch angle. Stability parameter Changeindrag due to pitch canard deflection angle. Feedforward referenceoutputmatrix. Roll angle error. System matrix describing the dynamic interaction between state variables. System controlinputmatrix. Optimal control feedback gain matrix. ix
PAGE 10
G l(ti) Optimal system state feedback gain matrix. G2(ti) Optimal target state feedback gain matrix. G3(ti) Optimal reference state feedback gain matrix. g Acceleration duetogravity. H(.) System state output matrix. Ixx,Iyy,Izz Momentofinertial withrespect to the given axis.JCosttogofunction for the mathematical optimization. L() System noise input matrix.LOStability parameter Equilibrium change in Z axis velocity.LOwt Stability parameter Change in Z axis velocity duetoweight. L u Stability parameter Change in Z axis velocity duetoforward velocity.LexLexStability parameter Change in Z axis velocity duetoangleofattack. Stability parameter Change in Z axis velocity due to angleofattack rate. Stability parameter Change in Z axis velocity duetopitch rate. Stability parameter Change in Z axis velocity duetopitch angle. Stability parameter Change in Z axis velocity duetopitch canard deflection angle. Stability parameter Equilibrium change in roll rate. Stability parameter Change in roll rate duetosideslip angle. Stability parameter Change in roll rate due to sideslip angle rate. Stability parameter Change in roll rate due to roll rate. Stability parameter Change in roll rate due to yaw rate. Stability parameter Change in roll rate due to roll canard deflection angle. Stability parameter Change in roll rate due to yaw canard deflection angle.x
PAGE 11
M Massofthemissile.MOStabilityparameter Equilibriumpitchrate. M u StabilityparameterChangeinpitchrateduetoforwardvelocity.McxStabilityparameterChange inpitchratedueto angleofattack.McxStabilityparameterChange inpitchratedueto angleofattackrate. M q StabilityparameterChange inpitchrateduetopitchrate. MOe StabilityparameterChangeinpitchrateduetopitchcanarddeflectionangle.NOStabilityparameter-Equilibriumyaw rate. N B Stabilityparameter Change in yaw ratedueto sideslip angle. N B StabilityparameterChange in yaw ratedueto sideslip angle rate. N p Stabilityparameter Change in yawratedueto roll rate. N r StabilityparameterChange in yawrateduetoyawrate. NOa StabilityparameterChangeinyawratedueto rollcanarddeflectionangle. NOr StabilityparameterChangeinyawratedueto yawcanarddeflectionangle.Nx,Ny,NzComponentsofappliedaccelerationonrespective missilebodyaxis. P Solution totheRiccatiequation.P,Q,RAngularratesabouttheX,Y,andZbodyaxis respectively.Q(.)State weightingmatrix.R(.)Controlweightingmatrix.R(.)Referencestate vector. S()State-Controlcross weighting matrix. T(.)Targetdisturbancestate vector. TgoTime-to-go.U Systeminputvector. xi
PAGE 12
V,V,W Linear velocities with respect to the X,Y,andZbodyaxis respectively.vs System noise process . Zero mean white Gaussian noise modeling uncorrelated state disturbances. Zero mean white Gaussian noise driving firstorderMarkov process modeling correlated state disturbances. IVtotlX(.)X,Y,ZYoYOwtex B Total missile velocity. System state vector. Body stabilized axis. Stability parameter Equilibrium change inYaxis velocity. Stability parameter Change inYaxis velocity due to weight. Stability parameter Change in Y axis ratedueto sideslip angle. Stability parameter Change in Y axis velocitydueto sideslip angle rate. Stability parameter ChangeinY axis velocitydueto roll rate. Stability parameter ChangeinY axis velocity due to yaw rate. Stability parameter ChangeinY axis velocity due to roll angle. Stability parameter Change in Y axis velocity due to rollcanarddeflection angle. Stability parameter Change in Y axis velocity due to yawcanarddeflection angle. Angleofattack. AngleofSideslip. System noise transition matrix. Reference state transition matrix. Target disturbance state transition matrix. System state transition matrix. xii
PAGE 13
Target model correlation time. Target elevation aspect angle. Seeker elevation gimbal angle. Target azimuth aspect angle. Seeker azimuth gimbal angle. xiii
PAGE 14
AbstractofDissertation Presented to the Graduate Schoolofthe UniversityofFlorida in Partial Fulfillmentofthe Requirements for the DegreeofDoctorofPhilosophy REACHABLE SET CONTROLFORPREFERRED AXIS HOMING MISSILESByDonald J. Caughlin, Jr. April1988Chairman: T.E. Bullock Major Department: Electrical Engineering The applicationofmodern control methodstothe guidance and controlofpreferred axis terminal homing missilesisnon-trivial in that it requires controlling a coupled, non-linear plant with severe control variable constraints, to intercept an evading target. In addition, the rangeofinitial conditionsisquite large andislimited only by the seeker geometry and aerodynamic performanceofthe missile. Thisisthe problem: Linearization will cause plant parameter errors that modify the linear trajectory. In non-trivial trajectories, both Ny andNzacceleration commands will, at some time, exceed the maximum value. The two point boundary problem is too complextocomplete in real time and other formulations are not capableofhandling plant parameter variations and control variable constraints. xiv
PAGE 15
Reachable Set Control directly adapts Linear Quadratic Gaussian (LQG) synthesis to the Preferred Axis missile,aswellasa large classofnonlinear problems where plant uncertainty and control constraints prohibit effectivefixed-final-timelinear control.Itisa robust control technique that controls a continuous system with sampled data and minimizes the effectsofmodeling errors. As a stochastic command generator/tracker, it specifies and maintains a minimum control trajectory to minimize the terminal impactoferrors generated by plant parameter (transfer function)ortarget set uncertainty while rejecting system noise and target set disturbances. Also, Reachable Set Control satisfies the Optimality Principle by insuring that saturated control,ifrequired, will occur during the initial portionofthe trajectory. With large scale dynamics determined by a dual reference in the command generator, the tracker gains can be optimized to the response timeofthe system. This separation results in an "adaptable" controller because gains are based on plant dynamics and cost while the overall systemissmoothly driven from some large displacement to a region where the relatively high gain controller remains linear. xv
PAGE 16
CHAPTER I INTRODUCTION The applicationofmodern control methods to the guidance and controlofpreferred axis terminal homing missiles has had only limited success [l,2,3]. This guidance problemisnon-trivial in thatitrequires controlling a coupled, non-linear plant with severe control variable constraints, to intercept an evading target. In addition, the rangeofinitial conditionsisquite largeandlimited only by the seeker geometry and aerodynamic performanceofthe missile. There are three major control issues that must be addressed: the couplednon-linearplantofthe Preferred Axis Missile; the severe control variable constraints; and implementation in the missile where the solutionisrequired to control trajectories lasting one(I)to two (2) seconds real time. There have been a numberofrecent advances in non-linear controlbutthese techniques have not reached the point where real time implementation in an autonomous missile controllerispractical [4,5,6]. Investigationofnon-linear techniques during this research did not improve the situation. Consequently, primarilydueto limitations imposed by real time implementation, linear suboptimal control schemes were emphasized. Bryson&Ho introduced a numberoftechniques for optimal control with inequality constraints on the control variables [7]. Eachofthese use variational techniques to generate constrained and unconstrained arcs that must be pieced together to construct the optimal trajectory.
PAGE 17
2Ingeneral, real time solutionofoptimal control problems with bounded controlisnot possible [8]. In fact, with the exceptionofspace applications, the optimal control solution has not been applied [9,10]. When Linear Quadratic Gaussian (LQG) techniques are used, the problemisnormally handled via saturated linear control, where the control is calculatedasifno constraints existed and then simply limited. This technique has been shown to be seriously deficient. In this case, neither stability nor controllability can be assured. Also, this technique can cause an otherwise initially controllable trajectory to become uncontrollable [11]. Consequently, a considerable amountoftime is spent adjusting the gainsofthe controller so that control input will remain below its maximum value. This adjustment, however, will force the controller to operate below its maximum capability [12]. Also, in the caseofthe terminal homing missile, the applicationofLQG controllers that do not violate an input constraint lead to an increasing acceleration profile and (terminally) low gain systems [13].Asa result, the performanceofthese controllersisnot desirable. Whileitisalwayspossible to tune a regulator to control the system to a given trajectory, the varianceofthe initial conditions, the time to intercept the target (normally a few seconds for a short range high performance missile), and the lackofa globally optimal trajectory due to the nonlinear nature, the best policyisto develop a suboptimal real time controller. The problemofdesigning a globally stable and controllable high performance guidance system for the preferred axis terminal homing missile is treated in this dissertation. Chapter 2 provides adequate background information on the missile guidance problem. Chapter 3 covers recent work on constrained
PAGE 18
3 control techniques. Chapters 4 and 5 discuss Robust Control and introduce "Reachable Set" Control, while Chapter 6 applies the technique to controlofapreferredaxis homing missile.Theperformanceof"Reachable Set" controlispresented in Chapter7.
PAGE 19
CHAPTER II BACKGROUND The preferred axis orientation missile has significant control input constraints and complicated coupled angular dynamics associated with the maneuvering. In the generic missile considered, the Z axis acceleration (see Figure 2.1) was structurally limited to100"g"with further limits on"g"resulting from a maximum angleofattackasa functionofdynamic pressure. Even though the Z axis was capableof100"g",the "skid-to-turn" capabilityofthe Y axis was constrained to 5"g"or less becauseofaerodynamic limitations a20:1difference. In addition to pitch (Nz) and yaw (Ny) accelerations, the missile can roll upto500 degrees per second to align the primary maneuver plane with the planeofintercept. Hence,bank-to-turn. x zFigure2.1Missile Reference System.4
PAGE 20
5The classical technique for homing missile guidanceisproportional navigation (pro nav). This technique controls the seeker gimbal angle ratetozero which (given constant velocity) causes the missiletofly a straight line trajectory toward the target [14,15]. In the late 70's aneffortwas made to use modern control theory to improve guidance laws forair-to-airmissiles.Forrecent research on this problem see, for example, [11].Asstated in the introduction, these efforts have not significantly improved the performanceofthe preferred axis homing missile.Ofthe modern techniques, two basic methodologies have emerged: one was a body-axis oriented control law that used singular perturbation techniques to uncouple the pitch&roll axis [16,17]. This technique assumed that roll rateisthe fast variable, an assumption that may not be true during the terminal phaseofan intercept. The second techniquewasan inertial point mass formulationthat controls inertial accelerations [18]. The acceleration commands are fixed with respecttothe missile body; but, since these commands can be related to the inertial reference via the Euler Angles, the solution is straight forward. Bothofthese methods have usually assumed unlimited control available and the inertial technique has relied on the autopilottocontrol the missile roll angle, and therefore attitude,tode rotate from the inertial to body axis. Missile Dynamics The actual missile dynamics are a coupled setofnonlinear forces and moments resolved along the (rotating) body axesofthe missile [19]. Linearizationofthe equations about a "steady state"ortrim condition,
PAGE 21
6neglectinghigherorderterms, resultsinthefollowingsetofequations (usingstandardnotation,see symbol keyinthe preface):ex=Q-PB+ Azb / IV totl B =-R-Pex+Ayb/ IVtotlLinearAccelerationsIU=RV-QW-{DO+DOwt }MIV=PW -RU+-{yo+ YOwtl M+ Yf3B+YaB+ YpP+YrR+ Y0 0 + Yoaoa + Yoror IW=QU-PV +-{LO+LOwt} MMomentEquations Q=MO/I yy +M u U+Mexex+Mexex+MqQ+ Moeoe (1)(2)(3) (4)(5) (6)+-----IyyIyy
PAGE 22
7 (7)+(8)+Linear Quadratic Gaussian Control Law For allofthe modern development models, a variationofa fixed-final-time LQG controller was usedtoshape the trajectory. Also, it was expected that the autopilot would realize the commanded acceleration. First, consider the effectofthe unequal body axis constraints. Assume that100"g"was commanded in each axis resulting in an acceleration vector45degrees from Nz.IfNyisonly capableof5"g",the resultant vector will be 42 degrees in error, an error that will havetobe corrected by succeeding guidance commands. Evenifthe missile has the timeorcapability to complete a successful intercept, the trajectory can not be considered optimal. Now consider the nonlinear natureofthe dynamics. The inertial linear systemisaccurately modeledasa double integratorofthe acceleration to determine position. However, the acceleration commandisa functionofthe missile state, equation(1),and therefore, itisnot possible to arbitrarily assign the input acceleration. And, given a body axis linear acceleration, the inertial
PAGE 23
8component will be severely modified by the rotation (especially roll)ofthe reference frame. Allofthese effects are neglectedinthe linearization. This thenisthe problem: In the intercept trajectories worth discussing, Ny, Nz, and roll acceleration commands will,atsome time, saturate. Highorder,linear approximations do not adequately model the effectsofnonlinear dynamics, and the complete two point boundary value problem with control input dynamics and constraintsistoo difficult to complete in real time. Although stochastic models are discussed in Bryson and Ho [7], and a specific techniqueisintroduced by Fiske [18], the general procedure has been to use filtered estimates and adynamic-programming-likedefinitionofoptimality (using the PrincipleofOptimality) with Assumed Certainty Equivalence to find control policies [20,21,22]. Therefore, allofthe controllers actually designed for the preferred axis missiles are deterministic laws cascaded with a Kalman Filter. The baseline forouranalysisisan advanced control law proposed by Fiske [18]. Given the finite dimensional linear system: where andxx(t)=Fx(t)+Gu(t)xyz VxVyVz(9)
PAGE 24
9with the cost functional:Jtfl=xfPfxf+ 1 uTRudrtoR=I(10)Applicationofthe Maximum principle results in a linear optimal control law:=-----3(Tgo) 31 +(Tgo)3+3(Tgo)2 31 +(Tgo)3(11)Coordinates usedforthis system are "relative inertial."Theorientationofthe inertial systemisestablished at the launch point.Thedistances and velocities are the relative measures between the missileandthe target. Consequently, the set pointiszero, with the reference frame moving with the missile similartoa "moving earth" reference used in navigation. Since Fisk's control law was based on a point mass model, the control law did not explicitly control the roll angle PHI (0). Theroll angle was controlledbyabank-to-turnautopilot [23]. Therefore, the guidance problemwasdecomposed into two components, trajectory formationandcontrol. The autopilot attempted to control the rollsothat thepreferredaxis (the-Zaxis) was directed toward the planeofintercept.Theautopilot used to control the missile was designed to use proportional navigationandisa classical combinationofsingle loop systems. Recently, WilliamsandFriedland have developed a newbank-to-turnautopilot based on modern state space methods [24]. Inorderto accurately control the banking maneuver, the missile dynamics are augmented to include the kinematic relations describing the change in the commanded specific force
PAGE 25
10vectorwithbankangle.Todeterminethe actual anglethroughwhichthe vehicle must roll,definethe roll angle error:Ayb e", =tan-l{--} Azb (12) Using thestandardrelationsforthe derivativeofavectorina rotatingreferenceframe,the following relationships followfromthe assumptionthatA11AlB:Azb=-P(Ayb)Ayb=+P(Azb) (13) (14)Theangle e", represents theerrorbetween the actualanddesiredroll angleofthe missile.Differentiating e", yields: (Azb)(Ayb) (Ayb)(Azb) e", =(Azb)2+(AYb)2which,aftersubstituting componentsofAxw, showsthat(15) (16) Simplifying the nonlinear dynamicsof(1) (8), the following model was used:oe=Q-PJ3+Azb / IVtotl 13 =-R-Poe+Ayb /IVtotl(Izz -Ixx ) Q=Moeoe+MqQ+ MSeSe +-----IyyPR(17) (18) (19)
PAGE 26
wherePQ(20)(21)(22)(23)U sing this model directly, the autopilot would be designedasan eighth-order system with time-varying coefficients. However, even though these equations contain bilinear terms involving the roll rate Paswellaspitch/yaw cross-coupling terms, the roll dynamics alone, represent a second order system thatisindependentofpitch and yaw. Therefore, using an "Adiabatic Approximation" where the optimal solutionofthe time-varying systemisapproximated by a sequenceofsolutionsofthe time-invariant algebraic Riccati equation for the optimum control law at each instantoftime, the model was separated into roll and pitch/yaw subsystems [25]. Now, similartoa singular perturbations technique, the functionofthe roll channelisto provide the necessary orientationofthe missilesothat the specific force acceleration lies on the Z (preferred) axisofthe missile. Using this approximation, the systemisassumed to be in steady state, and all coefficients--including rollrate--areassumed to be constant. Linear Quadratic Gaussian (LQG) synthesisisused, with an algebraic Riccati equation, on a second and sixth order system. And, when necessary, the gains are scheduledasa functionofthe flight condition.
PAGE 27
12While still simplified, this formulation differs significantly from previous controllers in two respects. First, the autopilot explicitly controls the roll angle; and second, the pitch and yaw dynamics are coupled. Even though preliminary work with this controller demonstrated improved tracking performance by the autopilot, overall missile performance, measured by miss distance and time to intercept, did not improve. However, the autopilot still relies on a trajectory generated by the baseline controller ( e.g. Azb in 17). Consequently, the missile performance problemisnot in the autopilot, the error sourceisin the linear optimal control law whichforms the trajectory. "Reachable Set Control"isa LQG formulation that can minimize these errors.
PAGE 28
CHAPTER III CONSTRAINED CONTROL In Chapters I and II,wecovered the non-linear plant, the dynamics neglected in the linearization, the impactofcontrol variable constraints, and the inabilityofimproved autopilots to reduce the terminal error. To solve this problem,wemust consider the optimal controlofsystems subject to input constraints. Although a searchofthe constrained control literature did not provide any suitable technique for real time implementation, someofthe underlying concepts were used in the formulationof"Reachable Set Control." This Chapter reviews someofthese results to focus on the constrained control problem and illustrate the concepts. Muchofthe early work was based on research reported by Tufts and Shnidman [26] which justified the useofsaturated linear control. However,asstated in the introduction, with saturated linear control, controllabilityisnot assured.Ifthe system, boundary values and final time are such that thereisno solution with any allowable control(Ifthe trajectoryisnot controllable), then the boundary condition will not be met by either azero terminal errororpenalty function controller. While constrained control can be studied in a classical way by searching for the effectofthe constraint on the valueofthe performance function, this procedureisnot suitable for real time controlofa system with a wide rangeofinitial conditions [27]. Someofthe techniques that could be implemented in real time are outlined below.13
PAGE 29
14Lim used a linearized gain to reduce the problemtoa parameter optimization [8]. Given the system model: x=Fx+Gu+Lw (1) with state x, constant F,G,and L, scaler control u, and Lw representing zero mean Gaussian white noise with covariance LLT. Consider the problemofchoosing a feedback law such that in steady state, assuming it exists, the expected quadratic costItfJ=E{lim [ (x(t)TQx(t)+ >.u(t)2) dt+X(tf)Tp(tf)X(tf)] }tf-00 to (2)isminimized. The weighting matrix Qisassumed to be positive semidefinite and >. O.Dynamic programming leadstoBellman's equation: min { t tr[LTYxx(x)L)+(Fx+Gu)Tyx(x)+xTQx+ >.u2 }= 0:.* and, assuming a Y(x) satisfying(3),the optimal solution u(x)=SAT {(1/2>')G T y x (x) }=SGN {GTyx(x)}(3)(4)However,(3)cannot be solved analytically, andYx in generalisa nonlinear functionofx. Consider a modified problem by assuming a controlofthe form: u(x)=SAT { gT x }=SGN{gTx } where gisa constant (free) vector. >'=0 (5a)(5b)Assume further that xisGaussian with known covarianceW(positive definite). Using statistical linearization, a linearized gain k can be obtained by minimizing E{u(x) -kTx}2(6)
PAGE 30
15which results in for (Sa): for (Sb): where 4>(z) = (2MtJ;xp (-ty 2)dyk= (2/7l")t .(gTWg)-t.g(7a)(7b)From (1), with u determined by kT x, the stable covariance matrixWandsteady statePareand(F+GkT)W+W(F+GkT)T+LLT=0 (F+GkT)Tp+P(F+GkT )+P+ >.kkT =0(8)(9)andThe solution to (3), without the minimum, isv(x)=xTpxex=tr{LT PL } (10)(11) The problemisto choose g suchthatthe expected costexby statistical linearizationisa minimum. However, a minimum may not exist. In fact, from [8], a minimum does not exist when the noise disturbance is large. Since we are considering robust control problems with plant uncertainty or significant modeling errors, the noise will be large and the minimum will be replaced by a greatest lower bound.Asexapproached the greatest lower bound, the control approached bang-bang operation. A combinationofplant errorsandthe rapid dynamicsofsome systems (such as the preferred axis missile) would preclude acceptable performance withbang-bangcontrol.
PAGE 31
16Frankenaand Sivan suggested a criterionthatreduce thetwo-pointboundary problem to an initial value problem [12]. They suggest controlling the plant while minimizing this performance index: With the constraint lIu(t)IIR(t) 1 Applying the maximum principle to the Hamiltonian developed from(12)x(t)=F(t)x(t)+G(t)u(t) with x(to)=xOprovides the adjoint differential equationto
PAGE 32
17ForGT Px:;: 0 andIIxliS :;: 0, P will be the solutionofPGR-IGTpS P+PF+=Q+---FTPIIR I GTpxIlRIIxliS(18)Now choosing S=PGR-IGTpresults in a Lyapunovequationandwill insure negativedefiniteP(t)ifFisa stability matrix.Therefore,withthis choiceofweighting functions totransformthe problem to a singleboundarycondition, a stable F matrixisrequired. Thisisa significant restrictionandnot applicable to the systemunderconsideration.Gutmanand Hagander developed a designforsaturatedlinear controllersforsystems with controlconstraints[9,28].Thedesign beginswitha low-gain stabilizing control, solves a Lyapunov equation tofinda regionofstability and associated stability matrix, andthensums the controlsina saturation function toformthe constrained control. Begin with the stabilizable continuous linear time invariant system x=Fx+Guwith admissible controlinputsui,suchthatx(O)=xO=l,...,m(19)where gi and hi are the control constraints. ConsiderannxmmatrixL == [11I12I...I1 m ] suchthatisa stability matrix.(20)(21)
PAGE 33
18Associated with eachofthe controlsaresetsthatdefineallowable conditions.Theset Disthe setofinitial conditionsfromwhichitisdesired to stabilize the system to the origin.Thelow gain stabilizing control LdefinesthesetE:E == E(L) == (zIzER(22)i=I,...,mwhichis the setofstatesatwhichthestabilizing linearfeedbackdoesnotinitiallyexceedthe constraints.Anotherset is F: F == F(L) ==n {(eFct)-1E}tE[O,oo)(23)(24) Fisa subsetofE suchthatalong all trajectories emanatingfromF,thestabilizing linear state feedback doesnotexceed the constraint.Theregionofstabilityforthe solutionofthe Lyapunovequationisdefinedby 0== O(L,P,c) == {xIxTPx c}whereV(x)=xTpxis the Lyapunovfunctioncandidateforthe stabilitymatrixFc,andc is to be determined.Thecontrol technique follows: Step1:DetermineD. Step2:FindLbysolving aLQGproblem.Thecontrol penalty is increaseduntilthecontrolLTx satisfies theconstraintin(19)forxinD.IfDissuchthatthecontrolconstraintcan notbesatisfied,thenthis designisnotappropriate.Step3:FindPandc.FirstfindaP=PT>0suchthattheLyapunovequationPFc+F cTp>O.Now determine 0 bychoosing cin(24) suchthatD 0 E:supxT Px c minxT PxxEDxE&E(25)
PAGE 34
19where 6E designates the boundaryofE.Ifthis fails, choose another P, or select a "lower gain" in order to enlarge E,orfinally, a reduction in the sizeofD might be considered. Step4:Set up the control according to u= SAT[ (L T -KGTp)x]where Kisdefined(26)K=[ki0]ki 0,i= 1,2, ... ,m (27)okmand tune the parameters ki by simulations. A sufficient condition for the algorithm to workisD E.(28)Unfortunately, determining the stability region was trial and error; and, once found, further tuningofa diagonal gain matrixisrequired. In essence, thiswasa technique for determining a switching surface between a saturated and linear control. Also, when the techniquewasapplied to an actual problem, inadequacies in the linear model were not compensated for. Given the nonlinear natureofthe preferred axis missile, rangeofinitial conditions, and the trial and error tuning required for eachofthese conditions, the procedure would not be adequate for preferred axis terminal homing missile control. A notable featureofthe control scheme, however, was the ability to maintain a stable system with a saturated control during muchofthe initial portionofthe trajectory. Another technique for control with bounded input was proposed by Spong et al. [29]. This procedure used an optimal decision strategytodevelop a pointwise optimal control that minimized the deviation between the actual and
PAGE 35
20desired vectorofjointaccelerations, subject toinputconstraints. The computationofthe control law is reduced to the solutionofa weighted quadratic programming problem. Key to this solution is the availabilityofa desired trajectory in state space. Suppose that a dynamical system can be described by with which can be writtenasx(t)=f(x(t+G(x(tu(t) c (29) Fix time t 0, let s(t,xo,to,u(t (or s(t) for short), denote the solution to (29) corresponding to the giveninputfunction u(t). At time t,ds/dtisthe velocity vectorofthe system, andisgiven explicitly by the right hand sideof(29). Define the set C t=C(s(t with C(s(t=(ex(t,w)ERNIex=f(s(t+G(s(tw,wE {} } {} ={wI NW:5 C}(30)Therefore, for each t andany allowable u(t),ds/dtliesinthe set Ct. In other words, the set C t contains the allowable velocitiesofthe solution s(t). Assume that there exists a desired trajectory yd, and an associated vector field v( t)=v(s(t),yd(t),t, which is the desired (state) velocityofs(t) to attain yd. Consider the following "optimal decision strategy" for a given positive definite matrixQ:Choose theinputu(t)sothat the corresponding solution s(t) satisfies(d/dt)s(t,u(t=s*(t), where s*(t)ischosenateach t to minimize min {(ex v(s(t),yd(t),tTQ(exv(s(t),yd(t),t }exECt(31)
PAGE 36
21Thisisequivalenttothe minimization min { !UTGTQGu -(GTQ(v-fTu } subject to, Nu(t) cu(32)Wemay now solve the quadratic programming problemtoyield a pointwise optimal control law for (29). At each time t, the optimal decision strategy attemptsto"align" the closed loop system with the desired velocity v(t)asnearlyaspossible in a least squares sense. In this way the authors retain the desirable propertiesofv(t) within the constraints imposed by the control. Reachable Set Control builds on this technique:itwill determine the desired trajectory and optimally track it. Finally, minimum-time control to the origin using a constrained acceleration has also been solved by a transformationtoa two-dimensionalun-constrained control problem[30].Byusing a trigonometric transformation, the controlisdefined by an angular variable, u(t) f{cos(I3),sin(I3)}, and the control problem was modifiedtothe controlofthis angle. The constrained linear problemisconvertedtoan unconstrained nonlinear problem that forces a numerical solution. This approach removes the effectofthe constraintsatthe expenseofthe continuous applicationofthe maximum control. Given the aerodynamic performance (range and velocity) penaltyofmaximumcontrol and the impact on attainable roll rates due to reduced stability at high angleofattack, this concept did not fit preferred axis homing missiles.Animportant assumption in the previous techniques was that the constrained systemwascontrollable. In fact, unlike (unconstrained) linear systems, controllability becomes a functionofthe set admissible controls, the initial state, the time-to-go, and the target state. To illustrate this, someofthe relevant points from [31,32] will be presented. An admissible controlisone
PAGE 37
22 that satisfies the condition u() :[0,00) 0 ERm where 0 is the control restraint set. The collectionofall admissible controls will be denoted by M(O). The target set Xisa specified subset in Rn.Asystem is defined to be O-controllable from an initial statex(tO)= xo to the target set X at Tifthere existsU()EM(O)such that x(T,u(),xO) EX.Asystem would be globally O-controllable to XifitisO-controllable toXfromeveryx(tO)ERn.Inordertopresent the necessary and sufficient conditions for O-controllability, consider the following system: x(t)=F(t)x(t)+G(t,u(tand the adjoint defined by: =F(t)T z(t) with the state transition matrix q>(t,r) and solution x(to) =xOZ(to) =zO(33)t f [0,00)(34)(35)The interior B and surface Softhe unit ball in Rn are defined asB={ZO f RnS={zOfRn llzoll I}IIzoll=I}(36) (37)The scaler functionJ(.):Rn xRxRnxRn-R is defined by J(xo,t,x,zo) =XOTZO+Jtmax [GT(r,w)z(r)]dr-x(t)Tz(t)o WfO (38)Given the relatively mild assumptionsof[32], a necessary conditionfor(33) to be O-controllable to X fromx(tO)ismax min J(xO,T,x,ZO) =0 XfXzOfB while a sufficient conditionissup min J(xO,T,x,zO)>0 XfXZOfS (39) (40)
PAGE 38
23The principle behind the conditions arises from the definitionofthe adjoint system--Z(t). Using reciprocity, the adjointisformed by reversing the roleofthe input and output, and running the system in reverse time [33]. Consider x(t)=F(t)x(t)+G(t)u(t) y(t)=H(t)x(t) and: z(t)=-F(t)Tz(t)+ o(t)=GT(t)z(t) Therefore x(to)=xOz(to)=zO(41) (42)and zT(t)x(t)=zT(F(t)x(t)+G(t)u(t(d/dt)(zT(t)x(t= T(t)x(t)+z T(t)x'(t)= +zT(t)G(t)u(t)(43) (44)Integrating both sides from tototfyields the adjoint lemma:(45)The adjoint defined in (31) does not have an input. Consequently, the integral in (35)isa measureofthe effectofthe control applied to the original system.Bysearching for the maximumGT(r,w)z(r),it provides the boundaryoftheeffectofallowable control on the system (33). Restricting the search over the target settothe min ( J(xo,t,x,zO) :t f [O,T],zo f S}ormin ( J(xo,t,x,zo) :t f [O,T],zo f B} minimizes the effectofthe specific selectionof Z() on the reachable set and insures that the search is over a function thatisjointly continuous in (t,x). Consequently, (35) compares the autonomous growthofthe system, the reachable boundaryofthe allowable input, and the desired target set and time. Therefore,ifJ=0, the adjoint lemmaisbe
PAGE 39
24identically satisfied at the boundaryofthe control constraint set (necessary); J>0 guarantees that a control can be found to satisfy the lemma.Ifthe lemmaissatisfied, then the initial and final conditions are connected by an allowable trajectory. The authors [32]goon to develop a zero terminal error steering control for conditions where the target setisclosed and max min J(xO,T,x,zO) 0 xXZOS (46)But their control technique has two shortcomings: First; it requires the selectionofz00The initial conditionzoisnot specified but limited toIIz01l=1.A particularzomust be selected to meet the prescribed conditions and the equality in (43 ) for a given boundary condition, andistherefore not suitable for real time applications. And second; the steering control searched M(O) for the supremumofJ, making the control laws bang-bang in nature, again not suitable for homing missile control. While a direct searchof Ox isnot appropriate for a preferred axis missile steering control, a "dual" system, similar to the adjoint system used in the formulationofthe controllability function J, can be usedtodetermine the amountofcontrol requiredtomaintain controllability. Once controllabilityisassured, then a cost function that penalizes the state deviation (as opposed to a zero terminal error controller) can be used to control the systemtoan arbitrarily small distance from the reference.
PAGE 40
CHAPTERIV CONSTRAINEDCONTROLWITHUNMODELEDSETPOINTANDPLANTVARIAnONSChapterIII reviewed anumberoftechniques tocontrolsystemssubjectto control variable constraints. While noneofthe techniques werejudgedadequateforreal time implementationofapreferred-axishoming missile controller, someofthe underlying conceptscanbe used to develop atechniquethatcanfunctionin the presenceofcontrol constraints:(l)Useofa "dual system"thatcanbe used to maintain a controllable system (trajectory); (2) an "optimal decision strategy" to minimize the deviation betweentheactualanddesiredtrajectorygenerated by the "dual system;"and(3) initiallysaturatedcontrolandoptimal (realtime) selectionofthe switchingsurfacetolinearcontrolwithzero terminalerror.However, inadditionto,andcompounding the limitations imposedbycontrol constraints, wemustalso consider the sensitivityofthecontrol to unmodeled disturbancesandrobustnessunderplant variations.Inthe stochastic problem, there are threemajorsourcesofplantvariations.First,therewill be modeling errors (linearization/reductions)thatwill causethedynamicsofthe system to evolve in adifferentor"perturbed" fashion. Second,theremay betheunmodeleduncertaintyinthe system statedueto Gaussian assumptions.Andfinally,inthe fixed final time problem, there may beerrorsinthe final time, especiallyifitisafunctionoftheuncertain stateorimpactedbythe modeling reductions. Since theprimaryobjectiveofthis researchisthe zeroerrorcontrolofa dynamical systeminfixed time, mostofthemore recent 25
PAGE 41
26optimization techniques (eg. LQG/LTR,WXl) did not apply. At this time, these techniques seemedtobe more attuned to loop shapingorrobust stabilization questions. A fundamental proposition that forms the basisofReachable Set Controlisthat excessive terminal errors encountered when using an optimal feedback control for an initially controllable trajectory (a controllable system that can meet the boundary conditions with allowable control values) are caused by the combinationofcontrol constraints and uncertainty (errors) in the target set stemming from unmodeled plant perturbations (modeling errors)orset point dynamics. First, a distinction must be made between a feedback and closed-loop controller. Feedback controlisdefinedasa control system with real-time measurement data fed back from the actual system but no knowledgeofthe form, precision, or even the existenceoffuture measurements. Closed-loop control exploits the knowledge that the loop will remain closed throughout the future intervaltothe final time.Itaddstothe information provided to a feedback controller, anticipates that measurements will be taken in the future, and allows prior assessmentofthe impactoffuture measurements.IfCertainty Equivalence applies, the feedback lawisa closed-loop law.Underthe Linear Quadratic Gaussian (LQG) assumptions, thereisnothing to be gained by anticipating future measurements. In the mathematical optimization, external disturbances can be rejected, and the mean valueofthe terminal error can be made arbitrarily close to zero by a suitable choiceofcontrol cost. For the following discussion, the "system" consistsofa controllable plant and an uncontrollable referenceortarget. The system stateisthe relative difference between the plant state and reference. Since changes in the system
PAGE 42
27boundary condition can be caused by either a change in the reference point or plant output perturbations similartothose discussed in Chapter II, some definitions are necessary. The setofboundary conditions for the combined plant and target system, allowing for unmodeled plant and reference perturbations, will be referredtoasthe target set. Changes, or potential for change, in the target set caused only by target (reference) dynamics will be referredtoasvariations in the set point. The magnitudeofthese changesisassumed to be bounded. Admissible plant controls are restrictedtoa control restraint set that limits the input vector. Since there are bounds on the input control, the system becomes non-linear in nature, and each trajectory must be evaluated for controllability. Assume that the system (trajectory)ispointwise controllable from the initialtothe boundary condition. Before characterizing the effectsofplant and set point variations, we must consider the formofthe plant and it's perturbations.Ifweassume that the plantisnonlinear and time-varying, thereisnot much that can be deduced about the target set perturbations. However,ifhave a reduced order linear modelofa combined linear and nonlinear process,ora reasonable linearizationofa nonlinear model, then the plant can be consideredaslinear and time-varying. For example, in the caseofa Euclidean trajectory. the system model (a double integrator)isexact and linear. Usually, neglected higher order or nonlinear dynamics or constraints modify the accelerations and lead to trajectory (plant) perturbations. Consequently, in this case, the plant can be accurately representedasa Linear Time Invariant System with (possibly) time varying perturbations.
PAGE 43
28Consider the feedback interconnectionofthe systems K and P where Kisa sampled-data dynamic controller and P the (continuous) controlled system:r-K uG--p_ .: Figure4.1Feedback System and Notation Assuming that the feedback systemiswell defined and Bounded Input Bounded Output (BIBO) stable, at any sample time ti, the system can be defined in termsofthe following functions: e(ti)=r(ti) y(ti) u( ti)=Ke( ti)y(ti)=Pu( ti)(1) (2)(3)
PAGE 44
with the operator G29G[K,P]asthe operator that maps the input e(ti)tothe output y(ti) [34]. At any time, the effectofa plant perturbation DoP can alsobecharacterizedasa perturbation in the target set. or thenIfP=Po + DoP P= P(I+DoP) y(ti)= YO(tj) + Doy(ti) (4a)(4b)(5)where Doy(ti) represents the deviation from the "nominal" output caused by either the additive or multiplicative plant perturbation. Therefore, e(ti)=r(ti) (YO(ti)+ Doy(ti = (r(tj) + Doy(ti -YO(ti)= Dor(ti) -YO(ti)(6)(7)(8)with Dor(ti) representing a change in the target set that was unknown to the controller. These changes are then fed backtothe controller but could be handled a priori in a closed loop controller designastarget set uncertainty. Now consider theeffectofconstraints.Ifthe controlisnot constrained, and target set errors are generated by plant variationsortarget maneuvers, the feedback controller can recover from these intermediate target set errors by using large (impulsive) terminal controls. The modeled problem remains linear. While the trajectoryisnot the optimal closed-loop trajectory, the trajectoryisoptimal based on the model and information set available. Even with unmodeled control variable constraints, and a significant dis placementofthe initial condition, an exact plant model allows the linear stochastic optimal controllertogenerate an optimal trajectory. The switching time from saturated to linear controlisproperly (automatically) determined and,
PAGE 45
30asin the linear case, the resulting linear control will drive the statetowithin an arbitrarily small distance from the estimateofthe boundary condition.Ifthecontrol constraint set covers the rangeofinputs required by the control law, the law will always be able to accommodate target set errors in the remaining time-to-go. This is, in effect, the unconstrained case. If, however, the cost-to-goishigherand/orthe deviation from the boundary conditionisofsufficient magnitude relative to the time remainingtorequire inputs outside the boundaryofthe control constraint set, the system will not follow the trajectory assumed by the system model.Ifthisisthe caseastime-to-goapproaches zero, the boundary condition will not be met, the systemisnot controllable (to the boundary condition).Astime-to-go decreases, the effectsofthe constraints become more important. With control input constraints, and intermediate target set errors caused by unmodeled target maneuvers or plant variations, it may not be possible for the linear control lawtorecover from the midcourse errors by relying on large terminal control. In this case, an optimal trajectoryisnot generated by the feedback controller, and, at the final time, the systemisleft with large terminal errors. Consequently,ifexternal disturbances are adequately modeled, terminal errors that are ordersofmagnitude larger than predicted by the open loop optimal control are caused by the combinationofcontrol constraints and target set uncertainty.
PAGE 46
31Linear Optimal Control with Uncertainty and Constraints An optimal solution must meet the boundary conditions. To accomplish this, plant perturbations and constraints must be considered a priori. They should be includedasa priori information in the system model, they must be physically realizable, and they must be deterministic functionsofa priori information, past controls, current measurements, and the accuracyoffuture measurements. From the control pointofview,wehave seen that the effectofplant parameter errors and set point dynamics can be groupedastarget set uncertainty. This uncertainty can cause a terminally increasing acceleration profile even when an optimal feedback control calls for a decreasing input (see Chapter 5). With the increasing acceleration caused by midcourse target set uncertainty, the most significant terminal limitation becomes the control input constraints. (These constraints not only affect controllability, they also limit how quickly the system can recover from errors.)Ifthe initial controlissaturated while the terminal portion linear, the controlisstill optimal.Ifthe final controlisgoingtobesaturated, however, the controller must account for this saturation. The controller could anticipate the saturation and correct the linear portionofthe trajectorytomeet the final boundary condition. This control, however, requires a closed formsolution for x(t), carries an increased cost for an unrealized constraint, and is known to be valid for monotonic ( single switching time) trajectories only [11]. Another technique availableisLQG synthesis. However, LQG assumes controllability in minimizing a quadratic cost to balance thecontrol error and
PAGE 47
32input magnitudes.Aswehave seen, the effectsofplant parameter and reference variations, combined with control variable constraints, can adversely impact controllability. The challengeofLQGisthe proper formulationofthe problem to function with control variable constraints while compensating for unmodeled set point and plant variations. Reachable Set Control uses LQG synthesis and overcomes the limitationsofan anticipative controltoinsure a controllable trajectory. Control Technique Reachable Set Control can be thoughtofasa fundamentally different robust control technique based on the concepts outlined above. The usual discussionofrobust feedback control (stabilization) centers on the developmentofcontrollers that function even in the presenceofplant variations. Using either a frequency domain or state space approach, and modeling the uncertain ty, bounds on the allowable plant or perturbations are developed that guarantee stability [35]. These bounds are determined for the specific plant under consideration and a controller is designedsothat expected plant variations are contained within the stability bounds. Building on ideas presented above, however, this same problem can be approached in an entirely different way. This new approach begins with the same assumptionsasstandard techniques, specifically a controllable system and trajectory. But, with Reachable Set Control,wewill not attempt to model the plant or parameter uncertainty, nor the set point variation.Wewill, instead, reformulate the problemsothat the system remains controllable, and thus stable, throughout the trajectory even in the presenceofplant perturbations and severe control input constraints.
PAGE 48
33Before we developanimplementable technique, considerthedesired resultofReachable Set Control (and the originofthe name) by using atwo-dimensional missile intercept problemasan example.Attime t= t}. not any specific time during the intercept, the targetisatsome location T1andthe missileisatMIasshown in Figure 4.2. Consider these locationsasoriginsoftwo independent, target and missile centered, reference systems.Fromthese initial locations, given the control inputs available, reachable sets for each system can be definedasafunctionoftime (not shown explicitly).Thetarget setiscircular because is maneuver directionisunknownbutits capabilitybounded,and the missile reachable set exponential because the x axis controlisconstant and uncontrollable while the z axis accelerationissymmetricandbounded.TheobjectiveofReachable Set Controlisto maintain the reachable targetsetin the interiorofthe missile reachable set. Hence, Reachable Set Control.xTargetReachableM1zMissileReachableSetFigure 4.2 Reachable Set Control Objective
PAGE 49
34As stated, Reachable Set Control would bedifficultto implementasa control strategy. Fortunately, however,furtheranalysis leads to a simple, direct, and optimal technique thatisvoidofcomplicated algorithms orad-hocprocedures. First, consider the process. The problem addressed is the controloffixed-terminal-timesystems. The true cost is the displacementofthe state at the final time and only at the final time. In the terminal homing missile problem, thisisthe closest approach, or miss distance. In another problem, it may be fuel remaining at the final time, or possibly a combinationofthe two. In essence, with respect to the direct applicationofthis technique, thereisno preferenceforone trajectory over another or no intermediate cost based on the displacementofthe state from the boundary condition. The term "direct application" was used because constrained path trajectories, suchasthose required by robotics, or the infinite horizon problem, like the controlofthe depthofa submarine can be addressed by separating the problem into several distinctintervals--eachwith a fixed terminaltime--ora switching surface when the initial objective is met [36]. Given a plant with dynamics x(t)=f(x,t)+g(u(w),t) y(t)=h(x(t),t) modeled by x(t)=F(t)x(t)+G(t)u(t) yx(t)=H(t)x(t) x(to)=xo(9) (10)
PAGE 50
35with final condition x(tf) and a compact control restraint setOx.Let Ox denote the setofcontrols u(t) for which u(t)E Ox for tE [0,00).The reachable set X(tO,tf,xO'Ox) == (x:x(tf)=solutionto(10) withxofor some u()EM(Ox) }isthe setofall states reachable fromxoin time tf. In addition to the plant and model in(9&10),wedefine the reference(11)r(t)=a(x,t)+b(a(w),t) y(t)=c(x(t),t) modeled by r(t)=A(t)r(t)+B(t)a(t)Yr(t)=C(t)r(t) and similarly defined set R(to,tf,ro,Or), r(to)=rO(12) (13) R(to,tf,ro,Or) == (r:r(tf)=solution to (13) withrOfor some a()EM(Or) }asthe setofall reference states reachable fromrOin time tf. (14) Associated with the plant and reference, at every time t,isthe following system: e(t)=yx(t) -Yr(t)(15) ;;m (10&13), we see that yx(t) andYr(t)are output functions that incorporate the significant characteristicsofthe plant and reference that will be controlled. The design objectiveise(tf)=0(16)and we want to maximize the probabilityofsuccess and minimize the effectoferrors generated by the deviationofthe reference and plant from their associated models. To accomplish this with a sampled-data feedback control law,
PAGE 51
36wewill select the control u(ti) such that, at the next sample time (ti+l), the target reachable set will be covered by the plant reachable set and, in steady state,ife(tf)=0, the control will not change. Discussion Recalling that the performance objective at the final timeisthe real measureofeffectiveness, and assuming that the terminal performanceisdirectly related to target set uncertainty, this uncertainty should be reduced with time-to-go. Now consider the trajectory remembering that the plant model is approximate (linearized or reduced order), and that the reference has the capability to change and possibly counter the control input. (This maneuverabili ty does not havetobe taken in the contextofa differential game.Itis only intended to allow for unknown set point dynamics.) During the initial portionofthe trajectory, the target set uncertaintyisthe highest. First, at this point, the unknown (future) reference changes have the capabilityofthe largest displacement. Second, the plant distance from the uncertain set pointisthe greatest and errors in the plant model will generate the largest target set errors becauseofthe autonomous response and the magnitudeofthe control inputs required to move the plant state to the set point. Along the trajectory, the contributionofthe target (reference) maneuvera bility to set point uncertainty will diminish with time. This statement assumes that the target (reference) capability to change does not increase faster than the appropriate integralofits' input variable. Regardlessofthe initial maneu verabilityofthe target, the time remainingisdecreasing, and consequently, the
PAGE 52
37ability to move the set point decreases. Target motionissmaller and it's positionismore and more certain. Selectionofthe control inputs in the initial stagesofthe trajectory that will result in a steady state control (that contains the target reachable set within the plant reachable set) reduces target set uncertainty by establishing the plant operating point and defining the effective plant transfer function. At this point,wedo not have a control procedure, only the motivation to keep the target set within the reachable setofthe plant along with a desire to attain steady state performance during the initial stagesofthe trajectory. The specific objectives are to minimize target set uncertainty, and most importantly, to maintain a controllable trajectory. The overall objectiveisbetter performance in termsofterminal errors. Procedure A workable control law that meets the objectives can be deduced from Figure 4.3. Herewehave the same reachable setforthe uncertain target, but this time, several missile origins are placed at the extremesoftarget motion. From these origins, the systemisrun backward from the final timetothe current time using control values from the boundaryofthe control constraint set to provide a unique setofstates that are controllable to the specific origin.Ifthe intersectionofthese setsisnon-empty, any potential target locationisreachable from this intersection. Figure 4.4 is similar, but this time the missile control restraint setisnot symmetric. Figure 4.4 shows a case where the missile acceleration controlisconstrainedtothe set A=[Amin,Amax] where 0 Amax(17)
PAGE 53
z38xTargetReachable AlltargetpositionsReachableFigure 4.3 IntersectionofMissile Reachable Sets BasedonUncertain Target Motion and Symmetric ConstraintsxTargetReachableSetAllTargetLocationsReachableFigure 4.4 IntersectionofMissile Reachable Sets Based on Uncertain Target Motion and Unsymmetric Constraints
PAGE 54
39Since controllabilityisassumed, which for constrained control includes the control bounds and the time interval, the extreme left and right (near and far) pointsofthe set point are included in the set drawn from the origin. To implement the technique, construct a dual system that incorporates functional constraints, uncontrollable modes, and uses a suitable control value from the control constraint setasthe input. From the highest probability target positionatthe final time, run the dual system backward in time from the final boundary condition. Regulate the plant (system) to the trajectory defined by the dual system. In this way, the fixed-final-time zero terminal error controlisaccomplished by re-formulating the problemasoptimal regulationtothe dual trajectory. In general, potential structuresofthe constraint set preclude a specific point (origin, center, etc.) from always being the proper inputtothe dual system. Regulation to a "dual" trajectory from the current target position will insure that the originofthe target reachable set remains within the reachable setofthe plant. Selectionofa suitable interior point from the control restraint setasinputto the dual system will insure that the plant has sufficient control power to prevent the target reachable set from escaping from the interiorofthe plant reachable set. Based on unmodeled set point uncertainty, symmetric control constraints, and a double integrator for the plant, a locus exists that will keep the target in the centerofthe missile reachable set.Ifthe set pointisnot changed, this trajectory can be maintained without additional inputs. For a symmetric control restraint set, especiallyasthe time-to-go approaches zero, Reachable Set Controliscontrol to a "coasting" (null control) trajectory.
PAGE 55
40Ifthe control constraints are not symmetric, such asFigure4.4, a locusofpoints that maintains the target in thecenterofthe reachable set is the trajectory based on the systemrunbackward from the final time target location with the acceleration command equal to the midpointofthe set A. Pictured in Figures 4.2 to 4.4 were trajectories that are representativeofthe double integrator. Other plant models would havedifferenttrajectories. Reachable Set Control is a simple techniqueforminimizing the effectsoftarget set uncertainty and improving terminal the performanceofa large classofsystems.Wecan minimize the effectsofmodeling errors (or target setuncertainty) by a linear optimal regulator that controls the system to a steady state control. Given the well known and desirable characteristicsofLQGsynthesis, this technique canbeusedasthe basisforcontrol to the desired "steady state control" trajectory.Thetechnique handles constraints by insuringaninitially constrained trajectory. Also, since the large scale dynamics are controlled by the "dual" reference trajectory, the tracking problem be optimized to the response timeofthe systemunderconsideration. This results inan"adaptable" controller because gains are basedonplant dynamicsandcost while the overall system is smoothlydrivenfrom some large displacement to a region where the relatively high gainLQGcontroller will remain linear.
PAGE 56
CHAPTERV REACHABLE SETCONTROLEXAMPLEPerformance Comparison Reachable SetandLOGControlInorderto demonstrate theperformanceof"Reachable Set Control" we will contrast its performance with theperformanceofa linear optimal controllerwhenthereistarget setuncertaintycombinedwithinputconstraints. Consider,forexample, thefinitedimensional linear system:withthequadratic cost x(to)=xo(1)whereJ1=-XfTpfXf 2(2)andtfE [0,00) '1 0Applicationofmaximum principle yields the following linear optimal control law:where1u=+-x(tf)(t-tf) '1 xo+xo tfx(tf)=1+41(3) (4)
PAGE 57
42Appropriately defining t, to, andtf,the control law can be equivalently expressed in an open loop or feedback form with the latter incorporating the usual disturbance rejection properties. The optimal control will tradeoff the costofthe integrated square input with the final error penalty. Consequently, even in the absenceofconstraints, the terminal performanceofthecontrolisa functionofthe initial displacement, time allowed to drive the statetozero, and the weighting factor ,. To illustrate this, Figure5.1presents the terminal states (miss distance and velocity)ofthe linear optimal controller. This plotisa compositeoftrajectories with different run times ranging from 0to3.0 seconds. The figure presents the valuesofposition and velocity at the final time t=tfthat result from an initial positionof1000 feet and with velocityof1000feet/sec with =104 Figure 5.2 depicts,asa functionofthe run time, the initial acceleration (at t=0.) associated with eachofthe trajectories shown in Figure 5.1. From these two plots, the impactofshort run timesisevident: the miss distance will be higher, and the initial acceleration command will be greater. Since future set point (target) motionisunknown, the suboptimal feedback controllerisreset at each sample time to accommodate this motion. The word resetissignificant. The optimal controlisa functionofthe initial condition at time t=to, time, and the final time. A feedback realization becomes a functionofthe initial condition and time to go only. In this case, set point motion (target set uncertainty) can place the controller in a position where the time-to-goissmall but the state deviationislarge.
PAGE 58
Velocity2000o-2000-4000 -6000 -8000-10000 -12000-14000o20043400600800Position10001200Figure5.1Terminal PerformanceofLinear Optimal ControlAccelerationo-100000--200000-300000v-400000o0.51Final Time1.52Figure 5.2 Initial AccelerationofLinear Optimal Control
PAGE 59
44While short control times will result in poorer performance and higher accelerations, it does not take a long run time to drive the terminal error to near zero. Also, from (4)wesee that the terminal error can be driventoan arbitrarily small value by selectionofthe control weighting. Figure5.1presented the final valuesoftrajectories running from 0 to 3 seconds. Figures 5.3 through 5.5 are plotsofthe trajectory parameters for the two second trajectory (with the same initial conditions) along with the zero control trajectory values. These values are determined by starting at the boundary conditionsofthe optimal control trajectory and running the system backward with zero acceleration. For example,ifwestart at the final velocity and run backwards in time along the optimal trajectory, for each point in time, thereisa velocity (the null control velocity) that will take the corresponding positionofthe optimal control trajectorytothe boundary without additional input. The null control position begins at the origin at the final time, and moving backward in time,isthe position that will take the system to the boundary condition at the current velocity. Therefore, these are the positions and velocities (respectively) that will result in the boundary condition without additional input.Ast=>tfthe optimal trajectory acceleration approaches zero. Therefore, the zero control trajectory converges to the linear optimal trajectory.Ifthe system has a symmetric control constraint set, Reachable Set Control will control the system position to the zero control (constant velocity) trajectory.
PAGE 60
Accelerationo-500-1000-1500 -2000-250000.5451Time1 .52Figure 5.3 Linear Optimal Acceleration vs Time r--Null Control VelocityVelocity1500 10005000-500........-1000 -15000.50..............1Time1.52Figure 5.4 Linear Optimal Velocity vs Time
PAGE 61
46Position20001000......o-1000-2000-3000o NullControlPosition0.511.5TimeFigure 5.5 Linear Optimal PositionvsTime2Consider now the same problemwithinputconstraints. Since Vet)is a linear functionoftime and the final state,itismonotonicandthe constrained optimal controlisIu=SAT(-x(tf)(t-tf(5) "I In this case, controllability is in question,andisa functionofthe initial conditionsandthetime-to-go.Assuming controllability, the final state will be given by: x(tf)=xQ+ xQtf a(tl)SGN(x(tf)[tf-(tl/2)](tf-t1)31+---(6)
PAGE 62
47wheretlisthe switching time from saturated to linear control. The open loop switch time can be shown to be(7)or the closed loop control can be used directly. In either case, the optimal control will correctly control the system to a final state X(tf) near zero. Figures 5.6 through 5.8 illustrate the impactofthe constraint on the closed loop optimal control. In each plot, the optimal constrained and unconstrained trajectoryisshown.Acceleration1000o-1000-2000-3000 -4000.............-500000.51Time1 .52Figure 5.6 Unconstrained and Constrained Acceleration
PAGE 63
48Velocity2000o-2000-4000Constrained o0.51Time1 .52Figure 5.7 Unconstrained and Constrained VelocityvsTime ...... ...........21 .51Time0.5Constrained Position------l Position250020001500 10005000 0Figure 5.8 Unconstrained and Constrained Position vs Time
PAGE 64
49Now consider the effectsoftarget set uncertainty on the deterministic optimal control by using the same control law for a 2.0 second trajectory where the boundary conditionisnot constantbutchanges. The reason for the target uncertainty and selectionofthe boundary condition can be seen by analyzing the componentsofthe modeled system. Assume that system actually consistsofan uncontrollable reference (target) plant as wellascontrolled (missile) plant with the geometry modeled by the difference in their states. Therefore, the final set point (relative distance)iszero,butthe boundary condition along the controlled (missile) trajectoryisthe predicted target position at the final time. This predicted position at the final timeisthe boundary condition for the controlled plant. Figures 5.9 through5.11are plotsoflinear optimal trajectories using the control law in (5,6). There are two trajectoriesineach plot. The boundary condition for one trajectoryisfixed at zero, the set point for the other trajectory is the pointwise zero control value (predicted target state at the final time). Figures 5.9 through5.11demonstrate the impactofthis uncertainty on the linear optimal control law by comparing the uncertain constrained control with the constrained control that has a constant boundary condition.
PAGE 65
Acceleration1000o-1000 -2000 -3000 -400050......UncertainTarget Set -500000.51Time1 .52Velocity2000o-2000 -4000Figure 5.9 Acceleration ProfileWithand Without Target Set Uncertainty TargetSet \ .......................o0.51Time1 .52Figure 5.10 Velocity vs TimeWithand Without Target Set Uncertainty
PAGE 66
51PositionUncertainTargetSet ---------1400 1200 1000800600400200o00.51Time1 .52Figure5.11PositionvsTime With and Without Target Set Uncertainty When thereistarget set uncertainty, simulated by the varying set point, the initial accelerationisinsufficient to prevent saturation during the terminal phase. Consequently, the boundary conditionisnot met. The final setofplots, Figures 5.12 through 5.14, contrast the performanceofthe optimal LQG closed loop controller that we have been discussing and the Reachable Set Control technique.Inthese trajectories, the final set pointiszerobutthereistarget set uncertainty again simulatedbya time varying boundary condition (predicted target position) that convergestozero. Although properly shownasa fixed final time controller, the Reachable Set Control results in Figures 5.12 through 5.14 are from a simple steady state (fixed gain) optimal tracker referencedtothe zero control trajectory r.
PAGE 67
52Thesystem modelforeach techniqueis(8)withx(to)=(x-r)oThelinear optimal controller has a quadratic costofJ=2Thereachable set controller minimizes(9)J= J : 1)[(x-r)TQ(x-r)+u(r)Tu(r)]dr(10)And,ineithercase, the value for r(t)isthe position that will meet theboundaryconditionatthe final time withoutfurtherinput.Acceleration1000o-1000 -2000 -3000-4000ReachableSetControl -500000.51Time1 .52Figure 5.12 Acceleration vsTimeLQGand Reachable Set Control
PAGE 68
Velocity2000o-2000 -400053.......................................................-__-_. "'-ReachableSetControlo0.51 Time1 .5 2".'.ReachableSetControl Position140012001000800600400200o0Figure 5.13 Velocity vs Time LQG and Reachable Set Control"..............0.51TimeFigure 5.14 Position vs Time LQG and Reachable Set Control1 .5'"2
PAGE 69
54Summary The improved performanceofReachable Set Controlisobvious from Figure 5.14. While demonstrated for a specific plant, and symmetric control constraint set, Reachable Set Controliscapableofimproving the terminal performanceofa large classofsystems.Itminimized the effectsofmodeling errors (or target set uncertainty) by regulating the system to the zero control state. The technique handled constraints and insured an initially constrained trajectory. The tracking problem could be optimized to the response timeofthe system under considerationbysmoothly driving the system from some large displacementtoa region where the relatively high gain LQG controller remained linear.
PAGE 70
CHAPTER VI REACHABLE SET CONTROL FOR PREFERRED AXIS HOMING MISSILES As statedinChapter II, the most promising techniques that can extend the inertial point mass formulation are based on singular perturbations [37,38,39]. When applied to the preferred axis missile, eachofthese techniques leads to a controller that is optimal in some sense. However, a discussionof"optimality" notwithstanding, the best homing missile intercept trajectory is the one that arrives at the final "control point" with the highest probabilityofhitting the target. This probability can be broken down into autonomous and forced events.Ifnothing is changed, what is the probabilityofa hitorwhat is the miss distance?Ifthe target does not maneuver, can additional control inputs result in a hit? And, in the worse case,ifthe target maneuvers (or an estimation error iscorrected) will the missile have adequate maneuverability to correct the trajectory? Noneofthe nonlinear techniques based on singular perturbations attempt to control uncertaintyoraddress the terminally constrained trajectories caused by increasing acceleration profile. Unfortunately, an increasing acceleration profile has been observed in allofthe preferred axis homing missile controllers. In many cases, the genericbank-to-turnmissileof[11,18] was on all three constraints (Ny,Nz,P) during the latter portionofthe trajectory.Ifthe evading targetisable toputthe missileinthis position without approaching it's own maneuver limits,itwill not be possible for the missile to counter the final evasive maneuver. The missileisno longer controllable to the target set. The "standard" solution to the increasing acceleration profileisa varyingcontrol cost.55However, without additional
PAGE 71
56 additional modifications, this typeofsolution results in a trajectorydependentcontrol. As we have seen, Reachable Set Control is anLQG controlimplementationthatmoves the system to the point wherefurtherinputs are not required. A Reachable Set Controller that willrejecttarget and system disturbances,cansatisfy both the mathematicalandheuristic optimality requirementsbyminimizing the cost yet maintaining a controllable system. Since the roll control hasdifferentcharacteristics, the discussionofthepreferredaxis homing missile controller using the Reachable Set Control technique will be separated into translationalandroll subsystems.Thetranslational subsystem has a suitable null control trajectorydefinedbythe initial velocity and uncontrollable acceleration provided by the rocket motor.Theroll subsystem, however, is significantlydifferent.Inorderforthepreferredaxis missile to function, thepreferredaxis must be properly aligned. Consequently, both roll angleerrorandroll rate should be zero at all times. In this case, the null control trajectory collapses to the origin. Acceleration Control System Model Since we want to control the relative target-missile inertial system to the zero state, the controller will bedefinedinthis reference frame. Eachofthe individual system states aredefined(in relative coordinates) as target state minus missile state. Begin with the deterministic system: x(t)=Fx(t)+Gu(t)(I)
PAGE 72
57where andxyzx=Vx Vy Vzu= Since the autopilot modelisa linear approximation and the inertial model assumes instantaneous response, modeling errors will randomly affect the trajectory. Atmospheric and other external influences will disturb the system. Also, the determinationofthe state will requirethe useofnoisy measurements. Consequently, the missile intercept problem should be approached via a stochastic optimal control law. Because the Reachable Set Control technique will minimize the effectofplant parameter variations (modeling errors) and un modeled target maneuverstomaintain controllability, we can use an LQG controller. Assuming Certainty Equivalence, this controller consistsofan optimal linear (Kalman) filter cascaded with the optimal feedback gain matrixofthe corresponding deterministic optimal control problem. Disturbances and modeling errors can be accounted for by suitably extending the system description[40]:x(t)=F(t)x(t)+G(t)u(t)+Vs(t) by adding a noise process V s (") to the dynamics equations with(2) (3)
PAGE 73
58Therefore, let the continuous time state description be formally given by the linear stochastic differential equationx(t)= ct>( t, to)x( to) dx(t)=F(t)x(t)dt+G(t)u(t)dt+L(t)df3(t) (with13(,.)a Wiener process) that has the solution: ct>( t,r)G(r)u(r)dr+
PAGE 74
state to converge with zero error.59Since the optimal solution to the linear stochasticdifferentialequation is a Gauss-Markov process, time correlated processescanbe includedbyaugmenting the system state to include the disturbance process.Letthe time-correlated target (position) disturbance be modeled by the following: T(t)=N(t)T(t)+Wt(t)(8)with[Tx(t)IT(t)=Ty(t) Tz(t) and =Wt(t) While the target disturbance resulting from anunknownacceleration is localized to a single plane with respect to the body axisofthe target, the target orientation is unknown to the inertial model. Consequently, following the methodologyofthe Singer Model, each axis will be treated equally [41]. Since the disturbance is firstorderMarkov, it's components will be:andN(t)=-(I/Tc)[I] Wt(t)=(20't2 / T c)[I](9) (10)where Tcisthe correlation time, and O'r isthe RMS valueofthe disturbance process.ThePower Spectral Densityofthe disturbanceis: Wtt(W) =20't2 / T cw2+O/Tc)2(11) Figure 6.1 summarizes the noise interactions with the system.
PAGE 75
60WhiteGaussianNoiseJ-------'AtmosphericdisturbancesActuatorErrorsAutopilotErrors"DeterministicControls LinearSystem OutputsPhysicalModelSystemResponse Jl TargetAccelerationsShapingFilterWhite Gaussian NoiseFigure6.1Reachable Set Control Disturbance processes.Withappropriate dimensions, the nine state (linear) augmented system model becomes:[ ]T(t)=Reference Model Reachable Set Control requires a supervisory steering control (reference) that includes the environmental impact on the controlled dynamic system. Recalling the characteristicsofthe dual system, one was developed that explicitlyran(I) backward in time after determining the terminal conditions. However, in developing this control for this preferred axis missile a numberoffactors actually simplify the computationofthe reference trajectory:
PAGE 76
61(I)Thecontrol constraint setforthispreferredaxis missile is symmetric.Consequently.thereferencetrajectoryforaninterceptcondition.is a null control (coasting)trajectory.(2)Thebodyaxis X acceleration isprovidedbythemissile motor,andisnotcontrollablebutknown. This uncontrollableaccelerationwillcontributetothetotal inertial acceleration vector.mustbeconsideredbythe controller.andistheonlyacceleration present onanintercept(coasting)trajectory.(3)Theterminationoftheinterceptistheclosestapproach.whichnowbecomes thefixed-final-time(tf).Thetime-to-go(tgo) isdefinedwith respect tothecurrenttime (t) by: t=tftgo(13)(4)Thefinalboundaryconditionforthesystem state (target minus missile) is zero.Insummary,theinterceptpositions are zero.theinitial velocity is given.andthe average acceleration is a constant.Therefore.itissufficientto reversethedirectionoftheinitial velocityandaverageaccelerationthenrunthesystemforwardintimefortgo secondsfromtheorigintodeterminethecurrentpositionofthecoasting trajectory. Let:withandr(t)=A(t)r(t)+B(t)a(t)r(O)=rO=0A(t)=F(t)andB(t)=G(t)(14)thenr(t) isthepointfromwhich theautonomoussystem dynamics will takethesystem todesiredboundarycondition. Becauseofthe disturbances. targetmotion,andmodelingerrors,futurecontrolinputsarerandomvectors.Therefore,thebest policy isnotto
PAGE 77
62determine the input over the control period [to,tfl a priori buttoreconsider the situation at each instant t on the basisofall available information. At each update,ifthe systemiscontrollable, the reference (and system state) will approach zeroastgo approaches zero. Since the objectiveofthe controlleristo drive the system state to zero,wedo not require a tracker that will maintain the control variable at a desired non-zero value with zero steady state error in the presenceofunmodeled constant disturbances. There are disturbances, but the final set pointiszero, and therefore, aPIcontrollerisnot required. Roll Control Definition The roll modeismost significant sourceofmodeling errors in the preferred axis homing missile. While non-linear and high order dynamics associated with the equationsofmotion, autopilot, and control actuators are neglected, the double integratorisan exact modelfordetermining inertial position from inertial accelerations. The linear system, however,isreferenced with respect to the body axis. Consequently, to analyze the complete dynamics, the angular relationship between the body axis and inertial references must be considered. Recall Friedland's linearized (simplified) equations. The angular relationships determine the orientationofthe body axis reference and the roll rate appeared in the dynamicsofall angular relationships. Yet,tosolve the system using linear techniques, the system must be uncoupled via a steady state (Adiabatic) assumption. Also, the roll angleisinertially defined and the effectofthe linear accelerations on the erroristotally neglected.
PAGE 78
63Froma geometric pointofview, this mode controls the rangeofthe orthogonal linear acceleration commandsandthe constrained controllabilityofthe trajectory. With a20:I ratio in thepitchandyaw accelerations, the ability topointthepreferredaxis in the "proper" direction is absolutely critical. Consequently, effective roll control is essential to the performanceofthepreferredaxis homing missile.Thefirst problem in defining the roll controller, is the determinationofthe "proper" direction. There are two choices.Thepreferredaxis could be aligned with the target positionorthedirectionofthe commanded acceleration.Thefirstselectionisthe easiest to implement.Theseeker gimbal angles provide adirectmeasureofintercept geometry (Figure 6.2), and the roll angleerrorisdefineddirectly:(15)TargetLOS eyzxFigure 6.2. Roll AngleErrorDefinitionfromSeeker Angles. This selection, however,isnot the most robust. Depending on the initial geometry, the intercept point may not beinthe planedefinedbythecurrent
PAGE 79
64lineofsight (LOS) and longitudinal axisofthe missile. In this case, the missile must continually adjust its orientation (roll) to maintain the target in the preferred plane.Asrange decreases the angular rates increase, with the very real possibilityofsaturation and poor terminal performance. Consequently, the second definitionofroll angle error should be used. Considering the dynamicsofthe intercept, however, aligning the preferred axis with the commanded acceleration vectorisnotasstraightforwardasit seems. Defining the roll angle erroras 0e =Tan-I{A y /A z } (16) leadstosignificant difficulties. From the previous discussion, itisobvious that roll angle errors must be minimizedsothat the preferred axis acceleration canbeusedtocontrol the intercept. The roll controller must have a high gain. Assume, for example, the missileison the intercept trajectory. Therefore, both A y and A z will be zero. Now,ifthe target moves slightly in theYbdirection and the missile maneuvers to correct the deviation, the roll angle error instan taneously becomes 90 degrees. High gain roll control inputs to correct this situation are counter productive. The small A y may be adequate to completely correct the situation before the roll mode can respond. Now, the combinationoflinear and roll control leads to instabilityasthe unnecessary roll rate generates errors in future linear accelerations. The problems resulting from the definitionofequation16can be overcomebere-examining the roll angle error. First, A y and A z combine to generate a resultant vector at an angle from the preferred axis. In the processofapplying constraints, the acceleration angle that results from the linear accelerations canbeincreased or decreased by the presenceofthe constraint.Ifthe angleisdecreased, the additional rollisneeded to line up the preferred axis and the
PAGE 80
65desired acceleration vector.Ifthe constrained (actual) acceleration angleisincreased beyond the (unconstrained) desired value by the unsymmetric actionofthe constraints, then the roll controller must allow for this "over control" caused by the constraints. Define the roll angle errorasthe difference between the actual and desired acceleration vectors after the control constraints are considered. This definition allows for the full skid toturncapabilityofthe missile in accelerat ing toward the intercept point and limits rolling to correct large deviations in acceleration angle from the preferred axis that are generated by small accelera tions. ReferringtoFigure 6.3, three zones can be associated with the following definitions: "ec =Tan-I(Ay/Az } 'lea =Tan-I(Ny/Nz } "er = "ec 'lea;ZoneIZoneIIZoneIIIFigure 6.3. Roll Control Zones.(17) (18) (19)
PAGE 81
66Here Ny and N z are the constrained acceleration values. In Zone I, "er =O.The linear acceleration can complete the intercept without further roll angle change. Thisisthe desired locus for the roll controller. Both A y and A z are limited in Zone II. Thisisthe typical situation for the initial positionofa demanding intercept. The objectiveofthe roll controlleristokeep the intercept acceleration outofZone III where only A yislimited. In this case, the A y accelerationisinsufficient to complete the intercept yet significant roll angle change may be requiredtomake the trajectory controllable. Controller A dual mode roll controllerwasdeveloped to accommodate the rangeofsituations and minimize roll angle error. Zone I requires a lower gain controller that will stabilize the roll rate and maintain "er small. Zones II and III require high gain controllers. To keep Zone II trajectories from entering Zone III, the "ec will be controlled to zero rather than the roll angle error. Since the linear control valueisalso a functionofthe roll angle error, roll angle errors are determined by comparing the desired and actual anglesofa fixed high gain reachable set controller.Ifthe actual linear commands are used and a linear accelerationissmall becauseoflarge roll angle errors, the actual amountofroll needed to line up the preferred axis and the intercept point, beyond the capabilityofthe linear accelerations, will not be available because they have been limited by the existing roll angle error that must be corrected. Unlike the inertial motions, the linear model for the roll controller accounts for the (roll) damping and recognizesthatthe inputisa roll rate change command: = -w" +wP(20)
PAGE 82
67Therefore, the roll mode elements (that will be incorporated into the model are) are: (21) Also, the dual mode controller will require an output function and weighting matrix that includes both roll angle and roll rate. Kalman Filter The augmented system model (13)isnot block diagonal. Consequently, the augmented system filter will not decouple into two independent system and reference filters. Rather, a single, higher order filter was required to generate the state and disturbance estimates. A target model (the Singer model)wasselected and modified to trackma-neuvering targets from a Bank-To-Turnmissile [41,42]. Using this model, a continuous-discrete Extended Kalman Filter was developed. The filter used a9 state target model for the relative motion (target missile): (22) with u(t) the known missile acceleration, N the correlation coefficient, and Wt(t) an assumed Gaussian white noise input with zero mean. Azimuth, elevation, range, and range rate measurements were available from passive JR, semi-active, analog radar, and digitally processed radar sensors. The four measurements are seeker azimuth (,p), seeker elevation(0),range (r), and range rate (dr/dt):
PAGE 83
68 6 =-Tan-l{z(x2 +y2)-1/2} t/J =+Tan-l{y/x}= 1r +Tan-l{y/x}r={x2+y2+z 2}-1/2 r= x 0x 0(23)Noise statistics for the measurements are a functionofrange, and are designed to simulate glintandscintillation in a relatively inexpensive missile seeker. In contrast to the linear optimal filter, theorderofthe measurements for the extended filterisimportant. In this simulation, the elevation angle (6) was processed first, followed by azimuth (t/J), range (r), and range rate(dr/dt).In addition, optimal estimates were available from the fusionofthe detailed (digital) radar model and IR seeker. Reachable Set Controller StructureTheTarget-Missile System is showninFigure 6.4. The combinationofthe augmented system state and the dual referencethatgenerates the minimum control trajectory for the reachable set concept is best describedasa CommandGenerator/Trackerandisshown in Figure 6.5.Ina single systemofequations the controller models the system response, including time correlated position disturbances, and provides the reference trajectory. Since only noise-corrupted measurementsofthe controlled system are available, optimal estimatesofthe actual states were used. Becauseofthe processing time requiredforthe filter and delays in the autopilot response, a continuous-discrete Extended Kalman Filter,anda sampled
PAGE 84
69data(discrete) controller was used. This controller incorporated discrete cross-coupling terms to control the deviations between the sampling timesaswell the capability to handle non-coincident sampleandcontrol intervals (Appendices B andC).Combining the linear and roll subsystems with a firstorderroll mode for the roll angle state, the model for the preferred axis homing missile becomes: the reference: r(t)=A(t)r(t)+B(t)a(t) with the tracking error: e(t)=[yx(t) Yr(t) ]=[H(t)I0I-C(t)] r(t)(25)(26)The initial stateismodeled asann-dimensional Gaussian random variable with meanxOand covariancePO.E{ws(t)ws(t)T}=Ws(t) is the strengthofthe system (white noise) disturbances to be rejected,andE{wt(t)Wt(t)T}=Wt(t)isaninput to a stationary firstorderGauss-Markov processthatmodels target acceleration. The positions are the primary variablesofinterest,andthe output matrices will select these terms. Along with the roll rate, these are the variables that will be penalized by the control costandthe states where disturbances will directly impact the performanceofthe system.
PAGE 85
In block form, with appropriate dimensions, the system matrices are:F(t)=A(t)=F=[0I]OOwG(t)=B(t)=G=[0]Iw(28)H(t)=C(t)=[Ihw ]N(t)=-(ljTc)[I]L(t)=IM(t)=Iwhere the Ow, I w and h w terms are required to specify the roll axis system and control terms:
PAGE 86
71{-wi=j=8{1i=j=8 (Ow>ij = (hw>ij =0otherwise0otherwise{+wi=8,j=4 (Iw)ij = I otherwise The performance objective for the LQG synthesisisto minimize an appropriate continuous-time quadratic cost: Js(t) = E{Jd(t)II(t)}(29)where J sisthe stochastic cost, I(t)isthe information set available at time t, and Jd a deterministic cost function:(30)Dividing the intervalofinterest into N+I intervals for discrete time control, and summing the integral cost generates the following (see AppendixC):(31)which can be related to the augmented state X=[xTr ]T by:][ X(tj) u(ti)](32)
PAGE 87
72In general, with the cost terms defined for the augmented state (AppendixC),the optimal (discrete) solution to the LQG tracker can be expressedas:where*=-[G(ti)] [ x(ti) T(ti) r(ti)](33)and G*(ti)=[R(ti) + GT(ti)P(ti+l)G(ti) ] -1 [ + ST(tj) ] P(ti)=Q(ti) + -[+ ST(ti) ]TG*(ti)(34) (35)Since only the positions (and roll rate) are penalized, the Riccati recursionisquite sparse. Consequently, by partitioning the gain and Riccati equations, and explicitly carrying out the matrix operations, considerable computationalim-provements are possible over the straightforward implementationofa19by19tracker (Appendix D). Application The tracking error and control costs were determined from the steady state tracker used in the example in Chapter5.First, missile seeker and aerodynamic limitations were analyzed to determine the most demanding intercept attainable by the simulated hardware. Then, autopilot delays were incorporated to estimate that amountoftime that a saturated control would require to turn the missile after correcting a 90 degree (limit case) roll angle error. The steady state regulator was used to interactively place the closed loop polesandselect a control cost combination that generated non zero control for the desired lengthoftime. These same values were used in the time varying Reachable Set Controller with the full up autopilot simulation to determine the
PAGE 88
73terminal error cost and control delay time. To maintain a basisofcomparison, the Kalman Filter parameters were not modified for this controller. Appendix E contains initial conditions for the controller and estimator dynamics. During the initialization sequence (safety delay) for a given run, time varying fixed-final-time LQG regulator gains are calculated (via 36) based on the initial estimateofthe time togo.Both high and low roll control gains were computed. These solutions used the complete Riccati recursion and cost based on the sampled data system, included a penalty on the final state (to control transient behaviorastgo approaches zero), and allowed for non-coincident sample and control. Given an estimated tgo, at each time t, the Command GeneratorITracker computed the reference position and required roll angle that leads to an intercept without additional control input. The highorlow roll control gain was selected based on the mode. Then the precomputed gains (that are a functionoftgo) are used with the state and correlated disturbance estimate from the filter, roll control zone, and the reference r to generate the control (whichisapplied only to the missile system). Becauseofsymmetry, the tracker gain for the state term equaled the reference gain,sothat, in effect, except for the correlated noise, the current difference between the state x and the reference r determined the control value. During the intercept, between sample times when the stateisextrapolated by the filter dynamics, tgo was calculated based on this new extrapolation and appropriate gains used. This technique demonstrated better performance than using a constant control value over the durationofthe sample interval and justified the computational penalty of the continuous discrete implementationofthe controller and filter.
PAGE 89
MeasurementNoise InterceptGeometry--Sensors--TargetControl--Disturbances TargetDynamics--OutputFunction+ l' Command -ZeroOrder GeneratorI .. Hold -DIATrackerStateEstimationSample&Hold AIDSensorsReachableSetControllerContinuous-DiscreteKalmanFilterMeasurementNoise Autopilot--Actuators Missile MissileDynamics-OutputFunctionDynamicDisturbancesFigure 6.4 Target Missile System
PAGE 90
r-Reference ModelReferenceVariableDynamicsa....Missile Model Yr(t)e(t) ,----,LJ-DynamicDisturbancesMeasurementNoise ", G1G3u(ti)I---{ "')-------1 G2Missile -TargetSystem X(t)Q!LJi Z(t)Yx(t)X(ti)TargetManeuverModelA(ti)ContinuousDiscreteKalmanFilter-Figure 6.5 Command Generator / Tracker
PAGE 91
CHAPTER VII RESULTS AND DISCUSSIONAsan additional reference, before comparing the resultsofReachable Set Control to the baseline control, consider anair-to-airmissile problem from [13]. In this example,the launch directionisalong the lineofsight, the missile velocityisconstant, and the autopilot response to commandsisinstantaneous. The controller has noisy measurementsoftarget angular location, a priori knowledgeofthe time to go, and stochastically models the target maneuver. Even with this relatively simple problem, the acceleration profile increases sharply near the final time. Unfortunately, this acceleration profileistypical, and has been observed in all previous optimal control laws. Reachable Set Control fixes this problem. c.9-...oa:)_u'"u u0oen--....t= .EenE0=: 300 200100o6Time to go (sec) O'--....L------l-.---l 12Figure7.1RMS Missile Acceleration76
PAGE 92
77Simulation The performanceofReachable Set Control was determined via a highfi-delity Bank-To-Turn simulation developed at the UniversityofFlorida and used for a numberofprevious evaluations. The simulationisbased on the coupled non-linear missile dynamicsofchapter II equations (1)to(8) andisa continuous-discrete system that has the capabilityofcomparing control laws and estimators at any sample time. In addition to the non-linear aerodynamic parameters, the simulation models the Rockwell Bank-To-Turn autopilot, sensor (seeker and accelerometer) dynamics, has a non-standard atmosphere, and mass modelofthe missile to calculate time-varying momentsofinertia and the missile specific acceleration from the time varying rocket motor. Figure 7.2 presents the engagement geometry and someofthe variables used to define the initial conditions.Missiley,ZTarget G ax D ..... U9 ;./\j! 9Figure 7.2 Engagement Geometry
PAGE 93
78The simulated targetisa three (3) dimensional, nine (9)"g"maneuvering target. Initially, the target trajectoryisa straight line. Once the range from the missile to targetisless than 6000 feet, the target initiates an instantaneous 9"g"evasive maneuver in a plane determined by the target roll angle, an input parameter.Ifthe launch rangeiswithin 6000 feet,the evasive maneuver begins immediately. Thereisa.4second "safety" delay between missile launch and autopilot control authority. Trajectory Parameters The performanceofthe control laws was measured with and without sensor noise using continuous and sampled data measurements. The integration step was .005 seconds and the measurement step for the Extended Kalman Filter was.05seconds. The trajectory presented for comparison has an initial offset angleof40 degrees (tPg) and180degree aspect (tPa), and a target rollof90 degrees away from the missile. This angleoffand target maneuverisoneofthe most demanding intercept for a preferred axis missile since it must roll through 90 degrees before the preferred axisisaligned with the target. Other intercepts were run with different conditions and target maneuvers to verify the robustnessofReachable Set Control and the miss distances were similar or less that this trajectory. Results Deterministic Results These results are the best comparisonofcontrol concepts since both Linear Optimal Control and Reachable Set Control are based on assumedCertainty Equivalence.
PAGE 94
79Representative deterministic results are presented in Table7.1and Figure 7.3. Figures A.l throughA.9present relevant parameters for the 4000 foot deterministic trajectories. Table7.1Deterministic Control Law Performance Initial Control Time Miss Range Distance (feet) (sec) (feet) 5500 Baseline 2.34 8 Reachable 2.34 6 5000 Baseline2.2113Reachable2.21104800 Baseline 2.1715Reachable 2.17 4 4600 Baseline 2.13 29 Reachable 2.13 6 4400 Baseline 2.0638Reachable 2.08 7 4200 Baseline 2.0235Reachable 2.05 5 4000 Baseline1.9854Reachable 2.00133900 Baseline1.9843Reachable1.988 3800 Baseline1.9840 Reachable1.988 3700 Baseline 2.02 44 Reachable 1.99103600 Baseline 1.99136Reachable 1.9965
PAGE 95
80MissDistance(feet) BaselineGuidanceLaw ............./ SetControl140 f-120 100 80 f-60-40 20 e-j500 40004500Initial Range50005500Figure7.3Deterministic Results An analysisoftrajectory parameters revealed that oneofthe major performance limitations was the Rockwell autopilot. Designed for proportional navigation with noisy (analog) seeker angle rates, the self adaptive loops in the autopilot penalized a high gain control law suchasReachable Set Control. This penalty prevented Reachable Set Control from demonstrating quicker intercepts and periodic control that were seen with a perfect autopilot on a similar simulation used during the research. However, even with the autopilot penalty, Reachable Set Controlwasabletosignificantly improve missile performance near the inner launchboundary. This verifies the theoretical analysis, since thisisthe region where the target set errors, control constraints, and short run times affect the linear law most significantly.
PAGE 96
81Stochastic Results Stochastic performance was determined by 100 runsateach initial condition.Atthe terminationoftherun,the miss distanceandTimeofFlight (TOF) was recorded. During eachofthese runs, the estimatorandseeker (noise)errorsequences were tracked. Both sequences were analyzed to insure gaussian seeker noise, and an unbiased estimator (with respect to each axis).Fromthe final performance data, the meanandvarianceofthe miss distance was calculated. Also, from the estimator and seeker sequences, the root mean square (RMS)errorand varianceforeach run was determined toidentifysome general characteristicsofthe process. The averageofthese numbersispresented. Care must be taken in interpreting these numbers. Since the measurement errorisafunctionofthe trajectoryaswell as instantaneoustrajectoryparameters, a singlenumberisnot adequate to completely describe the total process. Table 7.2andFigure 7.4 present average results using the guidance laws with noisy measurementsandthe Kalman filter.MissDistance(feet)250 200150 100BaselineGuidanceLaw..,..' ., .....50 ReachableSetControl S500 40004500Initial Range50005500Figure 7.4 Stochastic Results
PAGE 97
82 Table 7.2 Stochastic Control Law Performance Initial Control Time Miss Distance RMS Error Range Mean VarianceEKFSeeker (feet) (sec) (feet) (feet)(deg) 5500 Baseline 2.38 273 11470111.3Reachable2.41832677111.55000 Baseline 2.23 240 7874111.4Reachable 2.25 89 3937101.64800 Baseline 2.181937540101.4Reachable 2.191143708101.64600 Baseline 2.13 172 5699101.4Reachable 2.131071632101.54400 Baseline 2.081294324101.5Reachable 2.07 851421101.64200 Baseline 2.041233375101.6Reachable 2.03 62 673101.74000 Baseline2.011052745101.7Reachable2.01661401101.83900 Baseline 2.001053637101.8Reachable 2.00794356101.83800 Baseline 2.001245252101.8Reachable 1.9910510217102.0 3700 Baseline1.981595240101.8Reachable1.9817613078 91.83600 Baseline1.95230 6182101.7Reachable1.95239 14808101.7
PAGE 98
83The first runs made with Reachable Set Control were notasgoodasthe results presented. Reachable Set Control was only slightly (10to20feet) superiortothe baseline guidance law and was well below expectations. Yet, the performanceofthe filter with respect to position error was reasonable, manyofthe individual runs had miss distances near20feet,andmostofthe errors were in the Z axis. Analyzing several trajectories from various initial conditions ledtotwo main conclusions. First, the initial and terminal seeker errors were quite large, especially comparedtothe constant 5 mrad tracking accuracy assumed by many studies [2,3,18]. Second, the non-linear coupled natureofthe preferred axis missile, combined with range dependent seeker errors, and the system (target) model, makes the terminal performance a strong functionofthe particular sequenceofseeker errors. For example, Figure 7.5 compares the actual and estimated Z axis velocity (Target Missile) from a single 4000 foot run. The very first elevation measurement generated a14foot Z axis position error. A reasonable number considering the range. The Z axis velocity error, however, was quite large, 409 feet per second, and never completely eliminated by the filter. Recalling that the target velocityis969 feet per second,isapproximately co-altitude with the missile and maneuvers primarily in the XY plane, this errorissignificant when compared to the actual Z axis velocity(2feet per second). Also, thisisthe axis that defines the rollangle error and, consequently, roll rateofthe missile. Errorsofthis magnitude cause the primary maneuver planeofthe missile to roll away from the target limiting (via the constraints) the abilityofthe missile to maneuver. Further investigation confirmed that the filter was working properly. Although the time varying noise prevents a direct comparison for an entire trajectory, these large velocity errors are consistent with the covariance ratios
PAGE 99
84 in [41]. The filter model was developed to track maneuvering targets. Thepen-alty for tracking maneuvering targetsisthe inability to precisely define allofthe trajectory parameters (ie. velocity). More accurate (certain) models track better,butrisk losing track (diverging) when the target maneuvers unexpectedly. The problem with the control then, was the excessive deviations in the velocity. To verify this, the simulation was modifiedtouse estimatesofposition,buttouseactual velocities. Figure 7.6 and Table7.3has these results.Asseen from the table, the control performanceisquite good considering the noise statistics and autopilot.ZAxis Velocity(feet/sec)600400200 Actual, 0-200 -400 -6000 0.5 11 .5 2TimeFigure7.5MeasuredvsActual Z Axis Velocity
PAGE 100
85 Table 7.5 Stochastic ControlLawPerformance Using Actual Velocities Initial Control Time Miss Distance RMSErrorRange Mean VarianceEKFSeeker (feet) (sec) (feet) (feet) (deg) 5500 Baseline 2.35 34 318101.5 Reachable 2.3738504II1.5 5000 Baseline 2.21 50 506101.7Reachable 2.2338623101.6 4800 Baseline 2.1755367101.6 Reachable 2.19 48 526101.54600 Baseline 2.1261366101.6 Reachable 2.1452326101.6 4400 Baseline 2.06 62 333101.7 Reachable 2.1045415101.7 4200 Baseline 2.02 60 277101.7 Reachable 2.06 44 463101.84000 Baseline 1.98 62 330101.9 Reachable 2.0253lOll9 1.9 3900 Baseline 1.9855436101.9 Reachable 2.00 64 2052101.9 3800 Baseline 1.98 53 400 9 2.0 Reachable 1.99912427101.8 3700 Baseline 1.99 63 235102.0 Reachable 1.991403110101.7 3600 Baseline 1.98138354101.8 Reachable 1.96 213 4700101.6
PAGE 101
86MissDistance(feet)25020015010050 ,. ReachableSetControlBaseline GUidance/ ................................................... -------_...------9500 40004500InitialRange5000 5500Figure 7.6 Performance Using Position Estimates and Actual Velocities While the long term solution to the problemisa better target model that will accommodate both tracking and control requirements, the same model was used in order to provide a better comparison with previous research. For the same reason, the Kalman filter was not tuned to function better with the higher gain reachable set controller. However, target velocity changes used for the generationofthe roll angle error were limited to the equivalentofa 20 degree per second target turn rate. This limited the performance on a single run, but precluded the 300 foot miss that followed a 20 foot hit.
PAGE 102
87Conclusions Reachable Set ControlAsseen from the Tables7.1and 7.2, Reachable Set Controlwasinherently more accurate than the baseline linear law, especiallyin the more realistic case where noiseisadded and sampled data measurements are used. In addition, Reachable set control did insure an initially constrained trajectory for controllable trajectories, and required minimal accelerations duringthe terminal phaseofthe intercept. Unlike previous constrained control schemes, Reachable Set Control was better able to accommodate tinmodeled non-linearities and provide adequate performance with a suboptimal sampled data controller. While demonstrated with a Preferred Axis Missile, Reachable Set Controlisa general technique that could be used on most trajectory control problems. Singer Model Unmodified, the Singer model provides an excellent basis for a maneuver ing target tracker, but itisnot a good model to use for linear control. Neither control law penalized velocity errors. In fact, the baseline control law did not define a velocity error. Yet the requirementsoflinearity, and the integration from velocity to position, require better velocities estimates than are provided by this model (for this quality seeker).
PAGE 103
APPENDIXASIMULAnONRESULTS
PAGE 104
YPosition(feet)40003000 20001000o0 13 footmiss 100020003000XPosition(feet)FigureA.lXYMissile&Target Positions Reachable Set Control4000YPosition(feet)40003000 200054 feet 1000Behind Targeto0100020003000XPosition(feet)Figure A.2XYMissile&Target Positions Baseline Control Law894000
PAGE 105
90
PAGE 106
91Acceleration(g)10050 -o-50 f-t >1009CommandedAcceleration-100o0.51Time1 .52FigureA.SMissile Acceleration Reachable Set ControlAcceleration(g)10050o-50 I>1009CommandedAcceleration-100o0.51Time1 .52Figure A.6 Missile Acceleration Baseline Control Law
PAGE 107
92Roll Rate (deg/sec)6004002000-200-400 -60000.51Time...1.52Figure A.7 Missile Roll Commands&Rate Reachable Set ControlActualRoIIRate(deg/sec)600400200o-200 -400Commanded '.:-600o0.51Time1 .52Figure A.8 Missile Roll Commands&Rate Baseline Control Law
PAGE 108
93Roll AngleError(deg)50o-50o0.51Time1 .52Figure A.9 Missile Roll Angle Error
PAGE 109
APPENDIX B SAMPLED-DA TA CONVERSION System Model The system model for the formulationofthe sampled-data control law consistsofthe plant dynamics with time correlated and white disturbances: [ ]=[F(t) ][X(t)]+[GO(t)] u(t)+T(t) 0 N(t) T(t) the reference: r(t)=A(t)r(t)+B(t)a(t) with the tracking error:[ L0][WS(t) ](1)OMWt(t)(2)e(t)=[yx(t) -Yr(t)]=[H(t)I0I-C(t) ] )(3)r(t)andquadratic cost Js(t)=E{Jd(t)II(t)}, where(4)E{ws(t)ws(t)T}=Ws(t)isthe strengthofthe system (white noise) disturbances to be rejected. E{wt(t)Wt(t)T}=Wt(t) is an input to a stationary first order Gauss-Markov process that models target acceleration (see CH VI). The following assumes a constant cycle time that defines the sampling interval and a possible delay between sampling and controlof =ti' ti94
PAGE 110
9SThecomponentsofthe controlled system arex(t)=x(t)y(t)Z(t)0(t)x(t)T(t)= [::::: ]Tz(t) r(t)(5)o0u(t)y(t) z(t)0(t)Nx(t)Ny(t)Nz(t)P(t)e(t)x(t)y(t) z(t)0(t)ry(t) rz(t)r0(t)rx(t) ry(t) rz(t)r0(t)o0o0In block form, withappropriatedimensions, the system matrices are F(t) = A(t) =F=[0I]OwG(t)=B(t)=G= (6)H(t)=C(t)=[Ihw ]i=j=8 otherwise{+wi=8,j=4(Iw)ij=I otherwise L(t)=I(hw)jj={ M(t)=Ii=j=8 otherwise
PAGE 111
96Sampled-dataEquationsSystemFromthecontinuoussystem,thediscrete-timesampled-datasystemmodelcanbesummarizedinthefollowingsetofequations:y(ti')=H x(ti)+D u(ti)ForatimeinvariantFwithaconstantsampleinterval:
(t .t)-eF(t..-1.)eF(.lt) with .lt -ttxHI'1-HI1-HI-1I000 .lt 00 0 0I00 0 .ltO 0 00I0 0 0 .ltO 0 00I0 0 0 x 48x(.lt) =0 000I000 0 0000I00 0 000 00I0 0 000 000 x 88 with x 48(.lt) = (l!w(.lt)})x 88(.lt) =exp{ -w>(.lt)} (7) (8)
PAGE 112
97Also, since G and H are time invariantandthe u(ti) are piecewise continuous, theinputandoutputmatrices (allowing fornon-coincidentsamplingandcontrol) arefi+lG= cP(ti+l,r)G(r)dr (9)t.1H=H cP(ti',ti) (10) r D=H (11)t.1Therefore, At 2j2 00 0 0 At2j2 0 0 00 At 2j2 0 0 00 At-cPx4S(At) G= At 0 0 0 (12) 0 At 0 0 0 0 At 0 0 0 0 l-cPxSS(At) I00 0 At' 000 0I00 0 At' 00H=00I00 0 At' 0(13)000I0 0o cPX4S(At') 00 000 000 0 0 0 00 000 0 0 0 00 000 00 0 00 0o cPXSS(At')At,2j2 00 0 0 At,2 j 2 0 0 00 At,2j2 0 0 00 At' -cPX4S(At') D= At' 00 0 (14) 0 At' 0 0 0 0 At' 0 0 0 0 l-cPxSS(At')
PAGE 113
98Target DisturbanceFortime correlated disturbances, Wt(t)=Wt=2 at2 ITc where at is the RMS valueofthe noise process T(,.), and T c is the correlation time.N(t)=-(1/Tc)[I]The sampled data disturbanceis(15) (16)Wt(ti)isa sequenceofzero mean mutually uncorrelated random variables, and is givenby:with =T c(l the sampled data impact on the system.(17)
PAGE 114
100 C=C
PAGE 115
APPENDIX C SAMPLED DATA COST FUNCTIONS Assume that the performance objectiveisto minimize an appropriate continuous-time quadratic cost.(1)where J sisthe stochastic cost, Jd a deterministic cost, and I(t)isthe information set available at time t. Let Jd(t)=erT Prer +rr(e(t)TQ(t)e(')+u(t)TR(t)u(tdT to (2)Dividing the intervalofinterest into N+l control intervals for discrete time control, Jd(t)=e(tn+l)TPfe(tn+l)N Jti+1 + [t{e(t)TQ(t)e(t) + u(t)TR(t)u(t)}dr]1whereforall tE [tj,ti+l)' u(t)=u(ti). Substituting the following for e(t):(3)e(t)= + t.1+Jt ti101(4)
PAGE 116
102thedeterministiccost becomes: (5)+N E [i=OJti+1 [ [4>( t,ti)e(ti)tI+{Jt 4>(t,r)G(r)dr}u(ti) t.1+Jt {4>(t,r)L(r)dr}w(ti)]T t.1+{It 4>(t,r)G(r)dr}u(ti) t.I+{It 4>(t,r)L(r)dr}w(ti)] t.1Makingthefollowingsubstitutions:(6){It 4>(t,r)G(r)dr}T t.1Q(t) {Jr 4>(r,ti)G(r)dr}]dt .t1(7)J'i+l r'l;(t,T)L(T)dT} TQ(t) Www(ti)=[{t.t.I 1.{r 'l;(t,T)L(T)dT}]dt t.1 f+l4>(t,ti) T 1: Wxu(ti)=[Q(t){ 4>(t,r)G(r)dr}]dt t.1 1(8) (9)
PAGE 117
103fi+1J: Wxw(ti)=[ t.1iWWX(ti)=[f+'( f T t.t.1 1WUW(ti)=[fi+1( f T Q(t)t.t.1 1( f t.1WWU(ti)=[fi+'( f T Q(t)t.t.1 1( f t.1And,allowingforthefactthatforall tE[ti,ti+l):u(t)=u(ti) e(t)=e(ti) w(t)=w(ti)thedeterministiccostcanbe expressed as:Jd=e(tN+ 1)1r Pfe(tN+l)+N E (e(ti) 1r Wxx(ti)e(ti) + u(ti) 1r Wuu(ti)u(ti)i=O+ W(ti)1rWWW(ti)W(ti) + 2e(ti)1rWXU(ti)U(ti) + 2e(ti)1rWXW(ti)W(ti) +2u(ti)1rWUW(ti)W(ti)}(10) (11) (12) (13) (14) (15) (16)
PAGE 118
104RecallthatJs(t)=E{Jd(t)II(t)}andthatw(.,.) is zeromeananduncorrelatedwitheithereoru. Sinceiftwo variables x&yareuncorrelatedthenE{x,y}=E{x}E{y}.Therefore,E{2e(ti)T Wxw (ti)W(ti)}=2E{e(ti)T w (ti)}E{w(ti)}=0 E{2u(ti)TWxw(ti)W(ti)}=2E{u(ti)T w (ti)}E{w(ti)}=0and(17)Jt t.1(18)Jti+1=tr[Q(t){t.1Jt s(r)L(r) T Tdr}dt)t.1(19)Consequently,forconsiderationinJ sJd=e(tN+l)TPf e(tN+l)N+ E (e(ti)TWxx(ti)e(ti) + U(ti)TWuu(ti)U(ti) +Jw(ti)1=0(20)Wxu(ti) Wuu(ti) (21)
PAGE 119
105Forsmall sample times, the weighting functionscanbeapproximatedby:WxxQ(t).M Wxu(I/2)Q(t)G(t)(.6.t)2 Wuu-[R(t)+( I13)G( t) T Q(t)G(t)(.6.t)2](.6.t) (22)(23)(24)Inorderto relate the valuesofthe sampleddataweighting terms on the systemerrorto the state variables (using discrete variable notation), let: P(ti)=[H 0 -C]Tpr[ H0-C](25)Q(ti)=[H 0-C]T Wxx(ti)[ H0-C](26)R(ti)=Wuu (27) S(ti)=[H 0-C]T Wxu (28)Therefore,withthe augmented state variableX=[xTr ]T:ExpandingQandST](29)[ -CTWXX(ti)Hoo o ] CTWXX(ti)C(30)
PAGE 120
106With non-coincident sampling and control, the penalty terms are modifiedasfollows: e(ti')=[H0-C] +rti' This modification adds additional terms:[DE][U]ati'(31 ) (32){[HOC] +[DE][U]}T.r ti' a ti'C] [DE][U]}r ti' a ti' AgainwithWxx(ti)=Wxx(ti') and Wxu(ti')=Wxu(ti), the additional terms can be grouped withRandS to generate:R(ti')=R(ti)+DTWxx(ti)D+DTWxu(ti) S(ti')=S(ti)+DTWxx(ti)[ H0-C](33) (34)
PAGE 121
APPENDIXDLQGCONTROLLERDECOMPOSITIONThecombinationofthesystem, targetdisturbance,andthereference, result in a19state controller.However,thedecoupledstructure,symmetry,andthe zerosinthe controlinputandcost matricescanbeexploitedto streamline the calculations. In general, theoptimalsolution to theLQGtrackercanbe expressed as: whereand G*(ti) =[R(ti) +GT(ti)P(ti+l)G(ti)rl [GT(ti)P(ti+l)cI>(ti+l,ti) + ST(ti)] P(ti)=Q(ti) + cI>T(ti+}.ti)P(ti+l)cI>(ti+l,ti) [GT(ti)P(ti+l)cI>(ti+}.ti) + ST(ti)]TG*(ti)(I)(2)(3)Toreduce thenumberofcalculations,partitionthe gainandRiccatiequationsuchthat: Evaluating terms:(4)GT (ti)P( ti+ 1 )G(til=[ =[=[GT10I0][PIIPl2P13 ][G]P21P22 P23 0P31P32 P33 0GTPIIIGTPI2IGTP13J [g] GTPIIG]107(5)
PAGE 122
108Therefore,Now consider, GT (ti)P(ti+ 1 I ,ti) 0=[GT1010][PIIP12P13j 00P21P22 P230 0P31P32 P3300 0=[GTPIIIGTPl21GTP13 ] 00 0 0 00 (6) (7)[ I + I oFromAppendix C, S=[H0-C]TWxu Substituting, the required terms for the gain computation become:,ti) + ST(ti))= + WxuTHI + o(8)Consequently, with(9)the optimal control can be expressedas:G1(ti)=GI + WxuTH] G2(tj)=GI + 0G3(ti)=GI WxuTC](10)(11)(12)
PAGE 123
109Partitioningequation3:[PHPI2Pl3] [ 0-HTWXX('i)C](13) P21 P22 P23=0 (ti) P31 P32 P33(ti)-CTWXX(ti)H 0 4>n 0 T 4>n 0 4>x [ PI IP12Pl31 4>x 0 0 0 0 + 0 4>T 0 P21 P22P23 0 4>T 0 0 0 4>r P31 P32P33 (ti+ 1) 0 0 4>r4>n 0 -([ GTI0101[PIIPI2Pl3] 4>x 0 0P2IP22 P23 0 4>T 0P3IP32 P33ti+I)0 0 4>r +[WxuTHI01-WxuTC]}T [GIlG21G3]=[ -CTWxXCti)Ho oo ] CTWXX(ti)C (14)+ {4>xTPll}{[4>n TOJPll+ 4>xTP2t>{4>rTP3t>{4>x TPI2} {4>x TPI3} {[4>n T OJPI2+4>TTP22}{[4>n T 0]P13+4> T T P23 }{4>rTP32}{4>rTP33}[;X4>n )('i+l)0 4>T 0 GTPI2I GTPl31 [;x4>nil -{[GTPIII0 4>T 0 + [WxuTHI0I-WxuTCnT [GIIG21G3]
PAGE 124
= -cT Wxx(ti)H 110 g oCTWXX(ti)C (15) o + o o TH}T [GIIG21G3]o (ti+l)ForthepropagationoftheRiccatiequationonly these termsarerequiredtogeneratethecontrol gain:Pll(ti)=HTWxx(tj)H + TH}T][GIlPI2(ti)= + o-[[G2] P13(ti)=-HTWxx(ti)C + -(16)(17)(18)
PAGE 125
APPENDIX E CONTROLLER ANDFILTERPARAMETERS Controller Control Delay=.21seconds Target Maneuver Correlation time=.21seconds Riccati initialization (Pf) Linear controller::Pf=IE+2 for sample times<.01secondPf=lEIfor sampletimes>.01second Roll rate controller:Pf=1.0Quadratic Cost Terms (Continuous) Linear Accelerations Q=320.R=l.Roll Control (Angle) (Rate)QIlQ22RHI GAIN IE+lIE-4lE-3LOGAINlE-4lE-3Sample Time=integration (not sample) stepforthe continuous-discrete filter System Disturbance Input=(AT AM) (Tf-TO)2/ 2.III
PAGE 126
112Filter Target Correlation Time = 2.0 seconds Riccati initialization (Pf)Positions:PII= 2500 Velocities: P44 = 2.0E6 Covariance:Pl4= 5.0E4 Maneuver Excitation Matrix (2a2) = 5120000 Seeker Measurement Noise Azimuth&Elevation R= (SITH*SITH/(RNG HAT*RNGHA T)+SOTH*SOTH +SITHI*SITHI*RNG HAT**4)/MEAS ST Range R= (SOR*SOR+SIR*SIR*RNG HAT**4)/MEAS ST Range Rate R= (SODR*SODR+SIDR*SIDR*(RNG HAT**4) )/MEAS ST With SITH = SIPH =1.5SOTH = SOPH = .225E-4 SITHI= SIPHI= 0.0 SOR = SODR = 3.0 SIR =IE-8SIDR = .2E-IO
PAGE 127
LISTOFREFERENCES [1] A. Arrow, "StatusandConcerns for Bank-To-TurnControlofTactical Missiles," GACIACPR-85-01,"Proceedingsofthe Workshop onBank-To-TurnControlled Terminal Homing Missiles," Vol1,Joint ServiceGuidanceandControl Committee, January 1985. [2] J.R. McClendon and P.L. Verges, "ApplicationsofModern Control and Estimation Theory to the GuidanceandControlofTacticalAir-to-AirMissiles," Technical ReportRG-81-20,"Research onFurureArmyModular Missile," USArmyMissile Command, Redstone Arsenal, Alabama, March 1981. [3] N.K. Gupta, J.W. Fuller,andT.L. Riggs, "Modern ControlTheoryMethodsforAdvanced Missile Guidance," Technical ReportRG-81-20,"Research onFurureArmy Modular Missile," USArmyMissile Command, Redstone Arsenal, Alabama, March 1981. [4] N.B. Nedeljkovic, "New Algorithms for Unconstrained Nonlinear Optimal Control Problems," IEEE Transactions on Automatic Control, Vol.AC-26,No.4,pp868-884, August 1981. [5] W.T. Baumann and W.J.Rugh,"Feedback ControlofNonlinear Systems byExtendedLinearization," IEEE Transactions on Automatic Control, Vol.AC-31,No.1,pp. 40-46, January 1986. [6] E.D. Sontag, "Controllability and Linearized Regulation,"DepartmentofMathematics, Rutgers University, New Brunswick, NJ (unpublished),14February 1097.[7]A.E. Bryson and Y.C. Ho, Applied Optimal Control, Blaisdell Publishing Company, Waltham, Massachusetts, 1969. [8] Y.-S. Lim, "LinearizationandOptimizationofStochastic Systems with Bounded Control," IEEE Transactions on Automatic Control, Vol.AC-15,No.1,pp49-52, February 1970. [9] P.-O.GutmanandP.Hagander,"ANew DesignofConstrained ControllersforLinearSystems," IEEE Transactions on Automatic Control, Vol.AC-30,No. 1,pp22-33, January 1985. [10] R.L. Kousut, "Suboptimal ControlofLinearTime-InvariantSystems Subject to Control Structure Constraints," IEEE TransactionsonAutomatic Control, Vol.AC-15,No.5,pp 557-562, October 1970.113
PAGE 128
114[11] D.J. Caughlin, "Bank-To-TurnControl," Master's Thesis, UniversityofFlorida, 1983. [12] J.F. Frankena and R. Sivan,"Anon-linear optimal control law for linear systems," INT. J. CONTROL, Vol 30, NoI,pp159-178, 1979. [13] P.S. Maybeck, Stochastic Models. Estimation. and Control, Volume 3, Academic Press, New York, 1982. [14] S.A. Murtaugh and H.E. Criel, "FundamentalsofProportional Navigation," IEEESpectrum, pp.75-85, December 1966. [15] L.A. Stockum, and I.C. Weimer, "Optimal and Suboptimal Guidance for a Short Range Homing Missile," IEEE Trans. on Aerospace and Electronic Systems, Vol. AES-12,No.3,pp 355-361, May 1976. [16]B.Stridhar, and N.K. Gupta, "Missile Guidance Laws Based on Singular Perturbation Methodology," AIAA JournalofGuidance and Control, Vol 3,No.2,1980. [17] R.K. Aggarwal and C.R. Moore, "Near-Optimal Guidance LawforaBank-To-TurnMissile," Proceedings 1984 American Control Conference, Volume 3, pp. 1408-1415, June 1984. [18] P.H. Fiske, "Advanced Digital Guidance and Control Concepts forAir-To-AirTactical Missiles,"AFATL-TR-77-130,Air Force Armament Laboratory, United States Air Force, Eglin Air Force Base, Florida, January 1980. [19] USAF Test Pilot School, "Stability and Control Flight Test Theory,"AFFTC-77-I,revised February 1977. [20] L.C. Kramer andM.Athans, "On the ApplicationofDeterministic Optimization MethodstoStochastic Control Problems," IEEE Transactions on Automatic Control, Vol. AC-19,No.I,pp22-30, February 1974. [21]Y.Bar-Shalom and E. Tse, "DualEffect,Certainty Equivalence, and Separation in Stochastic Control," IEEE Transactions on Automatic Control, Vol. AC-19,No.5,pp 494-500, October 1974. [22] H. Van DE Water and J.C. Willems, "The Certainty Equivalence Property in Stochastic Control Theory," IEEE Transactions on Automatic Control, Vol. AC-26,No.5,pp1080-1086, October 1981. [23] "Bank-To-TurnConfiguration Aerodynamic Analysis Report" Rockwell International Report No. C77-142Ij034C, date unknown.
PAGE 129
115[24] D.E Williams andB.Friedland, "DesignofAn AutopilotforBank-To-TurnHoming Missile Using Modern Control and Estimation Theory," ProcFifthMeetingofthe Coordinating Group on Modern Control Theory, 15-27 October 1983, (Picatinny) Dover, New Jersey, October 1983, pp. 397-419. [25]B.Friedland,etaI., "On the "Adiabatic ApproximationforDesignofControl Laws for Linear, Time-Varying Systems,"IEEETransactions on Automatic Control, Vol.AC-32,No.1,pp. 62-63, January 1987. [26] D.W. Tufts and D.A. Shnidman, "Optimum Waveforms Subject to Both Energy and Peak-Value Constraints," Proceedingsofthe IEEE, September 1964. [27] M. Pontier and J. Szpirglas, "Linear Stochastic Control with Constraints, IEEE Transactions on Automatic Control, Vol.AC-29,No. 12, pp 1100-1103, December 1984. [28] P.-O.GutmanandS.Gutman,"ANote on the ControlofUncertain Linear Dynamical Systems with Constrained Control Input," IEEE Transactions on Automatic Control, Vol. AC-30,No.5,pp 484-486, May 1985. [29]M.W.Spong, J.S. Thorp, and J.M. Kleinwaks, "The ControlofRobot Manipulators withBounded Input," IEEE TransactionsonAutomatic Control, Vol. AC-31,No.6,pp 483-489, June 1986. [30]D.Feng and B.H. Krogh, "Acceleration-Constrained Time Optimal Control in n Dimensions," IEEE Transactions on Automatic Control, Vol. AC-31, No. 10, pp 955-958, October 1986. [31] B.R. BarmishandW.E. Schmitendorf, "New Results on ControllabilityofSystemsofthe Form x(t)=A(t)x(t)+F(t,u(t,"IEEETransactions on Automatic Control, Vol. AC-25,No.3,pp 540-547, June 1980. [32] W.-G. HwangandW.E. Schmitendorf, "Controllability Results for Systems with a Nonconvex Target," IEEE TransactionsonAutomatic Control, Vol. AC-29,No.9,pp 794-802, September 1984. [33] T. Kaliath, Linear Systems, Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1980. [34] M. Vidyasager, Nonlinear Systems Analysis, Prentice-Hall, Englewood Cliffs, New Jersey, 1978. [35] K. Zhou, and P. Khargonekar, "Stability Robustness Bounds for Linear State-Space Models with Structured Uncertainity," Transactions on Automatic Control, Vol.AC-32,No.7,pp 621-623, July 1987. [36] K.G. Shin and N.D. McKay, "Minimum Time ControlofRobotic Manipulators with Geometric Path Constraints," IEEE TransactionsonAutomatic Control, Vol. AC-30,No.6,pp 531-541, June 1985.
PAGE 130
116[37] A. Sabari and H. Khalil. "Stabilization and RegulationofNonlinear Singularly Perturbed Systems Composite Control," IEEE Transactions on Automatic Control, Vol.AC-30,No.8,pp. 739-747, August 1985. [38] M. Sampei and K.Furuta,"On Time Scaling for Nonlinear Systems: Application to Linearization." IEEE Transactions on Automatic Control, Vol.AC-31.No.5,pp 459-462, May 1986. [39] I.J. Ha and E.G. Gilbert."AComplete CharacterizationofDecoupling Control Laws for a General ClassofNonlinear Systems," IEEE Transactions on Automatic Control, Vol.AC-31,No.9,pp. 823-830. September 1986. [40] H. K wakernaak and R. Sivan, Linear Optimal Control Systems, Wiley-Interscience. New York, 1972. [41] R.A. Singer. "Estimating Optimal Tracking Filter Performance for Manned Maneuvering Targets," IEEE Transactions on Aerospace and Electronic Systems, Vol. AES6,No.4,pp 473-483, July 1970. [42] M.E Warren and T.E. Bullock, "DevelopmentandComparisonofOptimal Filter Techniques with Application toAir-to-AirMissiles," Electrical Engineering Department. UniversityofFlorida, Prepared for the Air Force Armament Laboratory, Eglin Air Force Base, Florida, March 1980.
PAGE 131
BIOGRAPHICAL SKETCH Donald J. Caughlin, Jr., was born in San Pedro, California on17Dec. 1946. He graduated from the United States Air Force Academy, earning aB.S.in physics, and chose pilot training insteadofan Atomic Energy Commission Fellowship. Since then he has flown over 3100 hours in over 60differentaircraft and completed one tour in Southeast Asia flying theA-ISkyraider. A Distinguished Graduateofthe United StatesAirForce Test Pilot School, Don has spent muchofhis career in research, development,andtest at both major test facilities--Eglin AFB in Florida, and Edwards AFBinCalifornia. Don Caughlinisa Lieutenant Colonel in theUnitedStatesAirForcecurrently assigned as the Assistant for SeniorOfficerManagement at HeadquartersAirForce Systems Command. In addition to theB.S.in physics,Lt.Colonel Caughlin has an M.B.A. from the UniversityofUtah, and a masters degreeinelectrical engineering from the UniversityofFlorida. Heisa memberofthe SocietyofExperimental Test Pilots and IEEE. Lt. Colonel Caughlinismarried to theformerBarbara SchultzofMontgomery, Alabama. They have two children, a daughter, Amy Marie, age eight, and Jon Andrew, age four.117
PAGE 132
AUTHOR: TITLE:Internet Distribution Consent AgreementIn reference to the following dissertation: Caughlin, Donald Reachable Set Control for Preferred Axis Homing Missles (record number: 1102747) PUBLICATION DATE: 1988I, fiJoJ as copyright holder for theaforementioned dissertation, hereby grant specific and limited archive and distribution rights to the BoardofTrusteesofthe UniversityofFlorida and its agents. I authorize the UniversityofFlorida to digitize and distribute the dissertation describedabove for nonprofit, educational purposes via the Internet or successive technologies. This is a non-exclusive grantofpermissions for specific off-line and on-line uses for an indefinite term. Off-line uses shall be limited to those specificallyallowedby"Fair Use" as prescribedbythe termsofUnited States copyright legislation (cf, Title 17, U.S. Code) as well as to the maintenance and preservationofa digital archive copy. DigbiMliM allows the UniversityofFlorida to generate imageand text-based versions as appropriate.and.to,provide and enhance access using search software. rohibits useofthe digitim4 versivns use or profit.personal information blurredDateofSignaturePleaseprint,signandreturnto: CathleenMartyniakUF Dissertation Project PreservationDepartmentUniversityofFloridaLibrariesP.O. Box117007 Gainesville,FL32611-7007512812008
PAGE 1
REACHABLE SET CONTROLFORPREFERREDAXIS HOMING MISSILESByDONALDJ.CAUGHLIN,JR. A DISSERTATION PRESENTED TOTHEGRADUATESCHOOL OFTHEUNIVERSITYOFFLORIDA IN PARTIALFULFILLMENTOFTHEREQUIREMENTS FORTHEDEGREEOFDOCTOR OF PHILOSOPHY UNIVERSITYOFFLORIDA1988
PAGE 2
Copyright 1988ByDONALD J.CAUGHLINJR.
PAGE 3
To BarbaraAmyJon
PAGE 4
ACKNOWLEDGMENTS The author wishestoexpress his gratitude to his committee chairman, Dr. T.E Bullock, for his instruction, helpful suggestions, and encouragement. Appreciationisalso expressed for the support and many helpful commentsfromthe other committee members, Dr. Basile, Dr. Couch, Dr. Smith, and Dr. Svoronos. iv
PAGE 5
TABLE OF CONTENTS ACKNOWLEDGMENTSivLIST OF FIGURES viiKEYTO SyMBOLS ix ABSTRACTxivCHAPTER I INTRODUCTION 1 IIBACKGROUND4 Missile Dynamics 5 Linear Accelerations 6 Moment Equations (i Linear Quadratic Gaussian Control Law 7 III CONSTRAINED CONTROL13IV CONSTRAINED CONTROL WITH UNMODELED SETPOINT ANDPLANTVARIA TIONS25Linear Optimal Control with Uncertainty and Constraints31Control Technique32Discussion36Procedure.37V REACHABLE SET CONTROL EXAMPLE.41Performance Comparison Reachable SetandLQGControl....41Summary.54VI REACHABLE SET CONTROL FORPREFERREDAXIS HOMING MISSILES55Acceleration Control.56System Model.56Disturbance Model58Reference Model.60Roll Control. 62 Definition 62 Controller 66 Kalman FiIter67Reachable Set Controller68Structure 68 Application 72 v
PAGE 6
VII RESULTS AND DISCUSSION76Simulation77Trajectory Parameters78Results78Deterministic Results78Stochastic Results81Conclusions87Reachable Set Control. 87 Singer Model. 87APPENDIXA SIMULATION RESUL TS88BSAMPLED-DATA CONVERSION 94 System Model 94 Sampled Data Equations 96 System 96 Target Disturbance 98 Minimum Control Reference 99 Summary 1 00 C SAMPLED DATA COST FUNCTIONS101DLQGCONTROLLERDECOMPOSITION.107 ECONTROLLERPARAMETERS111Control Law111Filter 112 LISTOFREFERENCES 113 BIOGRAPHICAL SKETCH 117 vi
PAGE 7
LISTOFFIGURES Figure Page2.1Missile Reference System 44.1Feedback System and Notation284.2 Reachable Set Control Objective334.3 IntersectionofMissile Reachable Sets Based on Uncertain Target Motion and Symmetric Constraints.384.4 IntersectionofMissile Reachable Sets Based on Uncertain Target Motion and U nsymmetric Constraints385.1Terminal PerformanceofLinear Optimal Control...435.2 Initial AccelerationofLinear Optimal Control.435.3 Linear Optimal AccelerationvsTime455.4 Linear Optimal VelocityvsTime455.5 Linear Optimal PositionvsTime lt6 5.6 Unconstrained and Constrained Acceleration .47 5.7 Unconstrained and Constrained Velocity vs Time.485.8 Unconstrained and Constrained PositionvsTime.485.9 Acceleration Profile With and Without Target Set Uncertainty 50 5.10 VelocityvsTime With and Without Target Set Uncertainty 505.11PositionvsTime With and Without Target Set Uncertainty515.12 AccelerationvsTime LQG and Reachable Set Control. 52 vii
PAGE 8
5.13 VelocityvsTime LQG and Reachable Set Control.535.14 PositionvsTime LQG and Reachable Set Control.536.1Reachable Set Control Disturbance processes606.2. Roll Angle Error Definition from Seeker Angles636.3. Roll Control Zones 65 6.4 Target Missile System746.5 Command Generator/Tracker757.1RMS Missile Acceleration767.2 Engagement Geometry777.3 Deterministic Results807.4 Stochastic Results817.5 MeasuredvsActual Z Axis Velocity 84 7.6 Performance Using Position Estimates and Actual Velocities86A.I XY Missile&Target Positions Reachable Set Control.89A.2 XY Missile&Target Positions Baseline Control Law89A.3XZMissile&Target Positions Reachable Set Control.90AAXZMissile&Target Positions Baseline Control Law 90 A.5 Missile Acceleration Reachable Set Control...91A.6 Missile Acceleration Baseline Control Law91A.7 Missile Roll Commands&Rate Reachable Set Control.92A.8 Missile Roll Commands&Rate Baseline Control Law92A.9 Missile Roll Angle Error93viii
PAGE 9
a()B()q.)D()DODOwtDoc DocDq DO E() F() G()KEYTO SYMBOLS Reference controlinputvector. Missile inertial x axis acceleration. Target inertial x axis acceleration. Specific force (drag) along X body axis. Desired linear acceleration about Z and Y body axes. Reference controlinputmatrix. Reference stateoutputmatrix. Feedforward stateoutputmatrix. Stability parameter Equilibrium drag coefficient. Stability parameter Change in drag due to weight. Stability parameter Changeindrag due to velocity. Stability parameter Change in drag due to angleofattack. Stability parameter Change in drag due to angleofattack rate. Stability parameter Change in drag due to pitch rate. Stability parameter Change in drag due to pitch angle. Stability parameter Changeindrag due to pitch canard deflection angle. Feedforward referenceoutputmatrix. Roll angle error. System matrix describing the dynamic interaction between state variables. System controlinputmatrix. Optimal control feedback gain matrix. ix
PAGE 10
G l(ti) Optimal system state feedback gain matrix. G2(ti) Optimal target state feedback gain matrix. G3(ti) Optimal reference state feedback gain matrix. g Acceleration duetogravity. H(.) System state output matrix. Ixx,Iyy,Izz Momentofinertial withrespect to the given axis.JCosttogofunction for the mathematical optimization. L() System noise input matrix.LOStability parameter Equilibrium change in Z axis velocity.LOwt Stability parameter Change in Z axis velocity duetoweight. L u Stability parameter Change in Z axis velocity duetoforward velocity.LexLexStability parameter Change in Z axis velocity duetoangleofattack. Stability parameter Change in Z axis velocity due to angleofattack rate. Stability parameter Change in Z axis velocity duetopitch rate. Stability parameter Change in Z axis velocity duetopitch angle. Stability parameter Change in Z axis velocity duetopitch canard deflection angle. Stability parameter Equilibrium change in roll rate. Stability parameter Change in roll rate duetosideslip angle. Stability parameter Change in roll rate due to sideslip angle rate. Stability parameter Change in roll rate due to roll rate. Stability parameter Change in roll rate due to yaw rate. Stability parameter Change in roll rate due to roll canard deflection angle. Stability parameter Change in roll rate due to yaw canard deflection angle.x
PAGE 11
M Massofthemissile.MOStabilityparameter Equilibriumpitchrate. M u StabilityparameterChangeinpitchrateduetoforwardvelocity.McxStabilityparameterChange inpitchratedueto angleofattack.McxStabilityparameterChange inpitchratedueto angleofattackrate. M q StabilityparameterChange inpitchrateduetopitchrate. MOe StabilityparameterChangeinpitchrateduetopitchcanarddeflectionangle.NOStabilityparameter-Equilibriumyaw rate. N B Stabilityparameter Change in yaw ratedueto sideslip angle. N B StabilityparameterChange in yaw ratedueto sideslip angle rate. N p Stabilityparameter Change in yawratedueto roll rate. N r StabilityparameterChange in yawrateduetoyawrate. NOa StabilityparameterChangeinyawratedueto rollcanarddeflectionangle. NOr StabilityparameterChangeinyawratedueto yawcanarddeflectionangle.Nx,Ny,NzComponentsofappliedaccelerationonrespective missilebodyaxis. P Solution totheRiccatiequation.P,Q,RAngularratesabouttheX,Y,andZbodyaxis respectively.Q(.)State weightingmatrix.R(.)Controlweightingmatrix.R(.)Referencestate vector. S()State-Controlcross weighting matrix. T(.)Targetdisturbancestate vector. TgoTime-to-go.U Systeminputvector. xi
PAGE 12
V,V,W Linear velocities with respect to the X,Y,andZbodyaxis respectively.vs System noise process . Zero mean white Gaussian noise modeling uncorrelated state disturbances. Zero mean white Gaussian noise driving firstorderMarkov process modeling correlated state disturbances. IVtotlX(.)X,Y,ZYoYOwtex B Total missile velocity. System state vector. Body stabilized axis. Stability parameter Equilibrium change inYaxis velocity. Stability parameter Change inYaxis velocity due to weight. Stability parameter Change in Y axis ratedueto sideslip angle. Stability parameter Change in Y axis velocitydueto sideslip angle rate. Stability parameter ChangeinY axis velocitydueto roll rate. Stability parameter ChangeinY axis velocity due to yaw rate. Stability parameter ChangeinY axis velocity due to roll angle. Stability parameter Change in Y axis velocity due to rollcanarddeflection angle. Stability parameter Change in Y axis velocity due to yawcanarddeflection angle. Angleofattack. AngleofSideslip. System noise transition matrix. Reference state transition matrix. Target disturbance state transition matrix. System state transition matrix. xii
PAGE 13
Target model correlation time. Target elevation aspect angle. Seeker elevation gimbal angle. Target azimuth aspect angle. Seeker azimuth gimbal angle. xiii
PAGE 14
AbstractofDissertation Presented to the Graduate Schoolofthe UniversityofFlorida in Partial Fulfillmentofthe Requirements for the DegreeofDoctorofPhilosophy REACHABLE SET CONTROLFORPREFERRED AXIS HOMING MISSILESByDonald J. Caughlin, Jr. April1988Chairman: T.E. Bullock Major Department: Electrical Engineering The applicationofmodern control methodstothe guidance and controlofpreferred axis terminal homing missilesisnon-trivial in that it requires controlling a coupled, non-linear plant with severe control variable constraints, to intercept an evading target. In addition, the rangeofinitial conditionsisquite large andislimited only by the seeker geometry and aerodynamic performanceofthe missile. Thisisthe problem: Linearization will cause plant parameter errors that modify the linear trajectory. In non-trivial trajectories, both Ny andNzacceleration commands will, at some time, exceed the maximum value. The two point boundary problem is too complextocomplete in real time and other formulations are not capableofhandling plant parameter variations and control variable constraints. xiv
PAGE 15
Reachable Set Control directly adapts Linear Quadratic Gaussian (LQG) synthesis to the Preferred Axis missile,aswellasa large classofnonlinear problems where plant uncertainty and control constraints prohibit effectivefixed-final-timelinear control.Itisa robust control technique that controls a continuous system with sampled data and minimizes the effectsofmodeling errors. As a stochastic command generator/tracker, it specifies and maintains a minimum control trajectory to minimize the terminal impactoferrors generated by plant parameter (transfer function)ortarget set uncertainty while rejecting system noise and target set disturbances. Also, Reachable Set Control satisfies the Optimality Principle by insuring that saturated control,ifrequired, will occur during the initial portionofthe trajectory. With large scale dynamics determined by a dual reference in the command generator, the tracker gains can be optimized to the response timeofthe system. This separation results in an "adaptable" controller because gains are based on plant dynamics and cost while the overall systemissmoothly driven from some large displacement to a region where the relatively high gain controller remains linear. xv
PAGE 16
CHAPTER I INTRODUCTION The applicationofmodern control methods to the guidance and controlofpreferred axis terminal homing missiles has had only limited success [l,2,3]. This guidance problemisnon-trivial in thatitrequires controlling a coupled, non-linear plant with severe control variable constraints, to intercept an evading target. In addition, the rangeofinitial conditionsisquite largeandlimited only by the seeker geometry and aerodynamic performanceofthe missile. There are three major control issues that must be addressed: the couplednon-linearplantofthe Preferred Axis Missile; the severe control variable constraints; and implementation in the missile where the solutionisrequired to control trajectories lasting one(I)to two (2) seconds real time. There have been a numberofrecent advances in non-linear controlbutthese techniques have not reached the point where real time implementation in an autonomous missile controllerispractical [4,5,6]. Investigationofnon-linear techniques during this research did not improve the situation. Consequently, primarilydueto limitations imposed by real time implementation, linear suboptimal control schemes were emphasized. Bryson&Ho introduced a numberoftechniques for optimal control with inequality constraints on the control variables [7]. Eachofthese use variational techniques to generate constrained and unconstrained arcs that must be pieced together to construct the optimal trajectory.
PAGE 17
2Ingeneral, real time solutionofoptimal control problems with bounded controlisnot possible [8]. In fact, with the exceptionofspace applications, the optimal control solution has not been applied [9,10]. When Linear Quadratic Gaussian (LQG) techniques are used, the problemisnormally handled via saturated linear control, where the control is calculatedasifno constraints existed and then simply limited. This technique has been shown to be seriously deficient. In this case, neither stability nor controllability can be assured. Also, this technique can cause an otherwise initially controllable trajectory to become uncontrollable [11]. Consequently, a considerable amountoftime is spent adjusting the gainsofthe controller so that control input will remain below its maximum value. This adjustment, however, will force the controller to operate below its maximum capability [12]. Also, in the caseofthe terminal homing missile, the applicationofLQG controllers that do not violate an input constraint lead to an increasing acceleration profile and (terminally) low gain systems [13].Asa result, the performanceofthese controllersisnot desirable. Whileitisalwayspossible to tune a regulator to control the system to a given trajectory, the varianceofthe initial conditions, the time to intercept the target (normally a few seconds for a short range high performance missile), and the lackofa globally optimal trajectory due to the nonlinear nature, the best policyisto develop a suboptimal real time controller. The problemofdesigning a globally stable and controllable high performance guidance system for the preferred axis terminal homing missile is treated in this dissertation. Chapter 2 provides adequate background information on the missile guidance problem. Chapter 3 covers recent work on constrained
PAGE 18
3 control techniques. Chapters 4 and 5 discuss Robust Control and introduce "Reachable Set" Control, while Chapter 6 applies the technique to controlofapreferredaxis homing missile.Theperformanceof"Reachable Set" controlispresented in Chapter7.
PAGE 19
CHAPTER II BACKGROUND The preferred axis orientation missile has significant control input constraints and complicated coupled angular dynamics associated with the maneuvering. In the generic missile considered, the Z axis acceleration (see Figure 2.1) was structurally limited to100"g"with further limits on"g"resulting from a maximum angleofattackasa functionofdynamic pressure. Even though the Z axis was capableof100"g",the "skid-to-turn" capabilityofthe Y axis was constrained to 5"g"or less becauseofaerodynamic limitations a20:1difference. In addition to pitch (Nz) and yaw (Ny) accelerations, the missile can roll upto500 degrees per second to align the primary maneuver plane with the planeofintercept. Hence,bank-to-turn. x zFigure2.1Missile Reference System.4
PAGE 20
5The classical technique for homing missile guidanceisproportional navigation (pro nav). This technique controls the seeker gimbal angle ratetozero which (given constant velocity) causes the missiletofly a straight line trajectory toward the target [14,15]. In the late 70's aneffortwas made to use modern control theory to improve guidance laws forair-to-airmissiles.Forrecent research on this problem see, for example, [11].Asstated in the introduction, these efforts have not significantly improved the performanceofthe preferred axis homing missile.Ofthe modern techniques, two basic methodologies have emerged: one was a body-axis oriented control law that used singular perturbation techniques to uncouple the pitch&roll axis [16,17]. This technique assumed that roll rateisthe fast variable, an assumption that may not be true during the terminal phaseofan intercept. The second techniquewasan inertial point mass formulationthat controls inertial accelerations [18]. The acceleration commands are fixed with respecttothe missile body; but, since these commands can be related to the inertial reference via the Euler Angles, the solution is straight forward. Bothofthese methods have usually assumed unlimited control available and the inertial technique has relied on the autopilottocontrol the missile roll angle, and therefore attitude,tode rotate from the inertial to body axis. Missile Dynamics The actual missile dynamics are a coupled setofnonlinear forces and moments resolved along the (rotating) body axesofthe missile [19]. Linearizationofthe equations about a "steady state"ortrim condition,
PAGE 21
6neglectinghigherorderterms, resultsinthefollowingsetofequations (usingstandardnotation,see symbol keyinthe preface):ex=Q-PB+ Azb / IV totl B =-R-Pex+Ayb/ IVtotlLinearAccelerationsIU=RV-QW-{DO+DOwt }MIV=PW -RU+-{yo+ YOwtl M+ Yf3B+YaB+ YpP+YrR+ Y0 0 + Yoaoa + Yoror IW=QU-PV +-{LO+LOwt} MMomentEquations Q=MO/I yy +M u U+Mexex+Mexex+MqQ+ Moeoe (1)(2)(3) (4)(5) (6)+-----IyyIyy
PAGE 22
7 (7)+(8)+Linear Quadratic Gaussian Control Law For allofthe modern development models, a variationofa fixed-final-time LQG controller was usedtoshape the trajectory. Also, it was expected that the autopilot would realize the commanded acceleration. First, consider the effectofthe unequal body axis constraints. Assume that100"g"was commanded in each axis resulting in an acceleration vector45degrees from Nz.IfNyisonly capableof5"g",the resultant vector will be 42 degrees in error, an error that will havetobe corrected by succeeding guidance commands. Evenifthe missile has the timeorcapability to complete a successful intercept, the trajectory can not be considered optimal. Now consider the nonlinear natureofthe dynamics. The inertial linear systemisaccurately modeledasa double integratorofthe acceleration to determine position. However, the acceleration commandisa functionofthe missile state, equation(1),and therefore, itisnot possible to arbitrarily assign the input acceleration. And, given a body axis linear acceleration, the inertial
PAGE 23
8component will be severely modified by the rotation (especially roll)ofthe reference frame. Allofthese effects are neglectedinthe linearization. This thenisthe problem: In the intercept trajectories worth discussing, Ny, Nz, and roll acceleration commands will,atsome time, saturate. Highorder,linear approximations do not adequately model the effectsofnonlinear dynamics, and the complete two point boundary value problem with control input dynamics and constraintsistoo difficult to complete in real time. Although stochastic models are discussed in Bryson and Ho [7], and a specific techniqueisintroduced by Fiske [18], the general procedure has been to use filtered estimates and adynamic-programming-likedefinitionofoptimality (using the PrincipleofOptimality) with Assumed Certainty Equivalence to find control policies [20,21,22]. Therefore, allofthe controllers actually designed for the preferred axis missiles are deterministic laws cascaded with a Kalman Filter. The baseline forouranalysisisan advanced control law proposed by Fiske [18]. Given the finite dimensional linear system: where andxx(t)=Fx(t)+Gu(t)xyz VxVyVz(9)
PAGE 24
9with the cost functional:Jtfl=xfPfxf+ 1 uTRudrtoR=I(10)Applicationofthe Maximum principle results in a linear optimal control law:=-----3(Tgo) 31 +(Tgo)3+3(Tgo)2 31 +(Tgo)3(11)Coordinates usedforthis system are "relative inertial."Theorientationofthe inertial systemisestablished at the launch point.Thedistances and velocities are the relative measures between the missileandthe target. Consequently, the set pointiszero, with the reference frame moving with the missile similartoa "moving earth" reference used in navigation. Since Fisk's control law was based on a point mass model, the control law did not explicitly control the roll angle PHI (0). Theroll angle was controlledbyabank-to-turnautopilot [23]. Therefore, the guidance problemwasdecomposed into two components, trajectory formationandcontrol. The autopilot attempted to control the rollsothat thepreferredaxis (the-Zaxis) was directed toward the planeofintercept.Theautopilot used to control the missile was designed to use proportional navigationandisa classical combinationofsingle loop systems. Recently, WilliamsandFriedland have developed a newbank-to-turnautopilot based on modern state space methods [24]. Inorderto accurately control the banking maneuver, the missile dynamics are augmented to include the kinematic relations describing the change in the commanded specific force
PAGE 25
10vectorwithbankangle.Todeterminethe actual anglethroughwhichthe vehicle must roll,definethe roll angle error:Ayb e", =tan-l{--} Azb (12) Using thestandardrelationsforthe derivativeofavectorina rotatingreferenceframe,the following relationships followfromthe assumptionthatA11AlB:Azb=-P(Ayb)Ayb=+P(Azb) (13) (14)Theangle e", represents theerrorbetween the actualanddesiredroll angleofthe missile.Differentiating e", yields: (Azb)(Ayb) (Ayb)(Azb) e", =(Azb)2+(AYb)2which,aftersubstituting componentsofAxw, showsthat(15) (16) Simplifying the nonlinear dynamicsof(1) (8), the following model was used:oe=Q-PJ3+Azb / IVtotl 13 =-R-Poe+Ayb /IVtotl(Izz -Ixx ) Q=Moeoe+MqQ+ MSeSe +-----IyyPR(17) (18) (19)
PAGE 26
wherePQ(20)(21)(22)(23)U sing this model directly, the autopilot would be designedasan eighth-order system with time-varying coefficients. However, even though these equations contain bilinear terms involving the roll rate Paswellaspitch/yaw cross-coupling terms, the roll dynamics alone, represent a second order system thatisindependentofpitch and yaw. Therefore, using an "Adiabatic Approximation" where the optimal solutionofthe time-varying systemisapproximated by a sequenceofsolutionsofthe time-invariant algebraic Riccati equation for the optimum control law at each instantoftime, the model was separated into roll and pitch/yaw subsystems [25]. Now, similartoa singular perturbations technique, the functionofthe roll channelisto provide the necessary orientationofthe missilesothat the specific force acceleration lies on the Z (preferred) axisofthe missile. Using this approximation, the systemisassumed to be in steady state, and all coefficients--including rollrate--areassumed to be constant. Linear Quadratic Gaussian (LQG) synthesisisused, with an algebraic Riccati equation, on a second and sixth order system. And, when necessary, the gains are scheduledasa functionofthe flight condition.
PAGE 27
12While still simplified, this formulation differs significantly from previous controllers in two respects. First, the autopilot explicitly controls the roll angle; and second, the pitch and yaw dynamics are coupled. Even though preliminary work with this controller demonstrated improved tracking performance by the autopilot, overall missile performance, measured by miss distance and time to intercept, did not improve. However, the autopilot still relies on a trajectory generated by the baseline controller ( e.g. Azb in 17). Consequently, the missile performance problemisnot in the autopilot, the error sourceisin the linear optimal control law whichforms the trajectory. "Reachable Set Control"isa LQG formulation that can minimize these errors.
PAGE 28
CHAPTER III CONSTRAINED CONTROL In Chapters I and II,wecovered the non-linear plant, the dynamics neglected in the linearization, the impactofcontrol variable constraints, and the inabilityofimproved autopilots to reduce the terminal error. To solve this problem,wemust consider the optimal controlofsystems subject to input constraints. Although a searchofthe constrained control literature did not provide any suitable technique for real time implementation, someofthe underlying concepts were used in the formulationof"Reachable Set Control." This Chapter reviews someofthese results to focus on the constrained control problem and illustrate the concepts. Muchofthe early work was based on research reported by Tufts and Shnidman [26] which justified the useofsaturated linear control. However,asstated in the introduction, with saturated linear control, controllabilityisnot assured.Ifthe system, boundary values and final time are such that thereisno solution with any allowable control(Ifthe trajectoryisnot controllable), then the boundary condition will not be met by either azero terminal errororpenalty function controller. While constrained control can be studied in a classical way by searching for the effectofthe constraint on the valueofthe performance function, this procedureisnot suitable for real time controlofa system with a wide rangeofinitial conditions [27]. Someofthe techniques that could be implemented in real time are outlined below.13
PAGE 29
14Lim used a linearized gain to reduce the problemtoa parameter optimization [8]. Given the system model: x=Fx+Gu+Lw (1) with state x, constant F,G,and L, scaler control u, and Lw representing zero mean Gaussian white noise with covariance LLT. Consider the problemofchoosing a feedback law such that in steady state, assuming it exists, the expected quadratic costItfJ=E{lim [ (x(t)TQx(t)+ >.u(t)2) dt+X(tf)Tp(tf)X(tf)] }tf-00 to (2)isminimized. The weighting matrix Qisassumed to be positive semidefinite and >. O.Dynamic programming leadstoBellman's equation: min { t tr[LTYxx(x)L)+(Fx+Gu)Tyx(x)+xTQx+ >.u2 }= 0:.* and, assuming a Y(x) satisfying(3),the optimal solution u(x)=SAT {(1/2>')G T y x (x) }=SGN {GTyx(x)}(3)(4)However,(3)cannot be solved analytically, andYx in generalisa nonlinear functionofx. Consider a modified problem by assuming a controlofthe form: u(x)=SAT { gT x }=SGN{gTx } where gisa constant (free) vector. >'=0 (5a)(5b)Assume further that xisGaussian with known covarianceW(positive definite). Using statistical linearization, a linearized gain k can be obtained by minimizing E{u(x) -kTx}2(6)
PAGE 30
15which results in for (Sa): for (Sb): where 4>(z) = (2MtJ;xp (-ty 2)dyk= (2/7l")t .(gTWg)-t.g(7a)(7b)From (1), with u determined by kT x, the stable covariance matrixWandsteady statePareand(F+GkT)W+W(F+GkT)T+LLT=0 (F+GkT)Tp+P(F+GkT )+P+ >.kkT =0(8)(9)andThe solution to (3), without the minimum, isv(x)=xTpxex=tr{LT PL } (10)(11) The problemisto choose g suchthatthe expected costexby statistical linearizationisa minimum. However, a minimum may not exist. In fact, from [8], a minimum does not exist when the noise disturbance is large. Since we are considering robust control problems with plant uncertainty or significant modeling errors, the noise will be large and the minimum will be replaced by a greatest lower bound.Asexapproached the greatest lower bound, the control approached bang-bang operation. A combinationofplant errorsandthe rapid dynamicsofsome systems (such as the preferred axis missile) would preclude acceptable performance withbang-bangcontrol.
PAGE 31
16Frankenaand Sivan suggested a criterionthatreduce thetwo-pointboundary problem to an initial value problem [12]. They suggest controlling the plant while minimizing this performance index: With the constraint lIu(t)IIR(t) 1 Applying the maximum principle to the Hamiltonian developed from(12)x(t)=F(t)x(t)+G(t)u(t) with x(to)=xOprovides the adjoint differential equationto
PAGE 32
17ForGT Px:;: 0 andIIxliS :;: 0, P will be the solutionofPGR-IGTpS P+PF+=Q+---FTPIIR I GTpxIlRIIxliS(18)Now choosing S=PGR-IGTpresults in a Lyapunovequationandwill insure negativedefiniteP(t)ifFisa stability matrix.Therefore,withthis choiceofweighting functions totransformthe problem to a singleboundarycondition, a stable F matrixisrequired. Thisisa significant restrictionandnot applicable to the systemunderconsideration.Gutmanand Hagander developed a designforsaturatedlinear controllersforsystems with controlconstraints[9,28].Thedesign beginswitha low-gain stabilizing control, solves a Lyapunov equation tofinda regionofstability and associated stability matrix, andthensums the controlsina saturation function toformthe constrained control. Begin with the stabilizable continuous linear time invariant system x=Fx+Guwith admissible controlinputsui,suchthatx(O)=xO=l,...,m(19)where gi and hi are the control constraints. ConsiderannxmmatrixL == [11I12I...I1 m ] suchthatisa stability matrix.(20)(21)
PAGE 33
18Associated with eachofthe controlsaresetsthatdefineallowable conditions.Theset Disthe setofinitial conditionsfromwhichitisdesired to stabilize the system to the origin.Thelow gain stabilizing control LdefinesthesetE:E == E(L) == (zIzER(22)i=I,...,mwhichis the setofstatesatwhichthestabilizing linearfeedbackdoesnotinitiallyexceedthe constraints.Anotherset is F: F == F(L) ==n {(eFct)-1E}tE[O,oo)(23)(24) Fisa subsetofE suchthatalong all trajectories emanatingfromF,thestabilizing linear state feedback doesnotexceed the constraint.Theregionofstabilityforthe solutionofthe Lyapunovequationisdefinedby 0== O(L,P,c) == {xIxTPx c}whereV(x)=xTpxis the Lyapunovfunctioncandidateforthe stabilitymatrixFc,andc is to be determined.Thecontrol technique follows: Step1:DetermineD. Step2:FindLbysolving aLQGproblem.Thecontrol penalty is increaseduntilthecontrolLTx satisfies theconstraintin(19)forxinD.IfDissuchthatthecontrolconstraintcan notbesatisfied,thenthis designisnotappropriate.Step3:FindPandc.FirstfindaP=PT>0suchthattheLyapunovequationPFc+F cTp>O.Now determine 0 bychoosing cin(24) suchthatD 0 E:supxT Px c minxT PxxEDxE&E(25)
PAGE 34
19where 6E designates the boundaryofE.Ifthis fails, choose another P, or select a "lower gain" in order to enlarge E,orfinally, a reduction in the sizeofD might be considered. Step4:Set up the control according to u= SAT[ (L T -KGTp)x]where Kisdefined(26)K=[ki0]ki 0,i= 1,2, ... ,m (27)okmand tune the parameters ki by simulations. A sufficient condition for the algorithm to workisD E.(28)Unfortunately, determining the stability region was trial and error; and, once found, further tuningofa diagonal gain matrixisrequired. In essence, thiswasa technique for determining a switching surface between a saturated and linear control. Also, when the techniquewasapplied to an actual problem, inadequacies in the linear model were not compensated for. Given the nonlinear natureofthe preferred axis missile, rangeofinitial conditions, and the trial and error tuning required for eachofthese conditions, the procedure would not be adequate for preferred axis terminal homing missile control. A notable featureofthe control scheme, however, was the ability to maintain a stable system with a saturated control during muchofthe initial portionofthe trajectory. Another technique for control with bounded input was proposed by Spong et al. [29]. This procedure used an optimal decision strategytodevelop a pointwise optimal control that minimized the deviation between the actual and
PAGE 35
20desired vectorofjointaccelerations, subject toinputconstraints. The computationofthe control law is reduced to the solutionofa weighted quadratic programming problem. Key to this solution is the availabilityofa desired trajectory in state space. Suppose that a dynamical system can be described by with which can be writtenasx(t)=f(x(t+G(x(tu(t) c (29) Fix time t 0, let s(t,xo,to,u(t (or s(t) for short), denote the solution to (29) corresponding to the giveninputfunction u(t). At time t,ds/dtisthe velocity vectorofthe system, andisgiven explicitly by the right hand sideof(29). Define the set C t=C(s(t with C(s(t=(ex(t,w)ERNIex=f(s(t+G(s(tw,wE {} } {} ={wI NW:5 C}(30)Therefore, for each t andany allowable u(t),ds/dtliesinthe set Ct. In other words, the set C t contains the allowable velocitiesofthe solution s(t). Assume that there exists a desired trajectory yd, and an associated vector field v( t)=v(s(t),yd(t),t, which is the desired (state) velocityofs(t) to attain yd. Consider the following "optimal decision strategy" for a given positive definite matrixQ:Choose theinputu(t)sothat the corresponding solution s(t) satisfies(d/dt)s(t,u(t=s*(t), where s*(t)ischosenateach t to minimize min {(ex v(s(t),yd(t),tTQ(exv(s(t),yd(t),t }exECt(31)
PAGE 36
21Thisisequivalenttothe minimization min { !UTGTQGu -(GTQ(v-fTu } subject to, Nu(t) cu(32)Wemay now solve the quadratic programming problemtoyield a pointwise optimal control law for (29). At each time t, the optimal decision strategy attemptsto"align" the closed loop system with the desired velocity v(t)asnearlyaspossible in a least squares sense. In this way the authors retain the desirable propertiesofv(t) within the constraints imposed by the control. Reachable Set Control builds on this technique:itwill determine the desired trajectory and optimally track it. Finally, minimum-time control to the origin using a constrained acceleration has also been solved by a transformationtoa two-dimensionalun-constrained control problem[30].Byusing a trigonometric transformation, the controlisdefined by an angular variable, u(t) f{cos(I3),sin(I3)}, and the control problem was modifiedtothe controlofthis angle. The constrained linear problemisconvertedtoan unconstrained nonlinear problem that forces a numerical solution. This approach removes the effectofthe constraintsatthe expenseofthe continuous applicationofthe maximum control. Given the aerodynamic performance (range and velocity) penaltyofmaximumcontrol and the impact on attainable roll rates due to reduced stability at high angleofattack, this concept did not fit preferred axis homing missiles.Animportant assumption in the previous techniques was that the constrained systemwascontrollable. In fact, unlike (unconstrained) linear systems, controllability becomes a functionofthe set admissible controls, the initial state, the time-to-go, and the target state. To illustrate this, someofthe relevant points from [31,32] will be presented. An admissible controlisone
PAGE 37
22 that satisfies the condition u() :[0,00) 0 ERm where 0 is the control restraint set. The collectionofall admissible controls will be denoted by M(O). The target set Xisa specified subset in Rn.Asystem is defined to be O-controllable from an initial statex(tO)= xo to the target set X at Tifthere existsU()EM(O)such that x(T,u(),xO) EX.Asystem would be globally O-controllable to XifitisO-controllable toXfromeveryx(tO)ERn.Inordertopresent the necessary and sufficient conditions for O-controllability, consider the following system: x(t)=F(t)x(t)+G(t,u(tand the adjoint defined by: =F(t)T z(t) with the state transition matrix q>(t,r) and solution x(to) =xOZ(to) =zO(33)t f [0,00)(34)(35)The interior B and surface Softhe unit ball in Rn are defined asB={ZO f RnS={zOfRn llzoll I}IIzoll=I}(36) (37)The scaler functionJ(.):Rn xRxRnxRn-R is defined by J(xo,t,x,zo) =XOTZO+Jtmax [GT(r,w)z(r)]dr-x(t)Tz(t)o WfO (38)Given the relatively mild assumptionsof[32], a necessary conditionfor(33) to be O-controllable to X fromx(tO)ismax min J(xO,T,x,ZO) =0 XfXzOfB while a sufficient conditionissup min J(xO,T,x,zO)>0 XfXZOfS (39) (40)
PAGE 38
23The principle behind the conditions arises from the definitionofthe adjoint system--Z(t). Using reciprocity, the adjointisformed by reversing the roleofthe input and output, and running the system in reverse time [33]. Consider x(t)=F(t)x(t)+G(t)u(t) y(t)=H(t)x(t) and: z(t)=-F(t)Tz(t)+ o(t)=GT(t)z(t) Therefore x(to)=xOz(to)=zO(41) (42)and zT(t)x(t)=zT(F(t)x(t)+G(t)u(t(d/dt)(zT(t)x(t= T(t)x(t)+z T(t)x'(t)= +zT(t)G(t)u(t)(43) (44)Integrating both sides from tototfyields the adjoint lemma:(45)The adjoint defined in (31) does not have an input. Consequently, the integral in (35)isa measureofthe effectofthe control applied to the original system.Bysearching for the maximumGT(r,w)z(r),it provides the boundaryoftheeffectofallowable control on the system (33). Restricting the search over the target settothe min ( J(xo,t,x,zO) :t f [O,T],zo f S}ormin ( J(xo,t,x,zo) :t f [O,T],zo f B} minimizes the effectofthe specific selectionof Z() on the reachable set and insures that the search is over a function thatisjointly continuous in (t,x). Consequently, (35) compares the autonomous growthofthe system, the reachable boundaryofthe allowable input, and the desired target set and time. Therefore,ifJ=0, the adjoint lemmaisbe
PAGE 39
24identically satisfied at the boundaryofthe control constraint set (necessary); J>0 guarantees that a control can be found to satisfy the lemma.Ifthe lemmaissatisfied, then the initial and final conditions are connected by an allowable trajectory. The authors [32]goon to develop a zero terminal error steering control for conditions where the target setisclosed and max min J(xO,T,x,zO) 0 xXZOS (46)But their control technique has two shortcomings: First; it requires the selectionofz00The initial conditionzoisnot specified but limited toIIz01l=1.A particularzomust be selected to meet the prescribed conditions and the equality in (43 ) for a given boundary condition, andistherefore not suitable for real time applications. And second; the steering control searched M(O) for the supremumofJ, making the control laws bang-bang in nature, again not suitable for homing missile control. While a direct searchof Ox isnot appropriate for a preferred axis missile steering control, a "dual" system, similar to the adjoint system used in the formulationofthe controllability function J, can be usedtodetermine the amountofcontrol requiredtomaintain controllability. Once controllabilityisassured, then a cost function that penalizes the state deviation (as opposed to a zero terminal error controller) can be used to control the systemtoan arbitrarily small distance from the reference.
PAGE 40
CHAPTERIV CONSTRAINEDCONTROLWITHUNMODELEDSETPOINTANDPLANTVARIAnONSChapterIII reviewed anumberoftechniques tocontrolsystemssubjectto control variable constraints. While noneofthe techniques werejudgedadequateforreal time implementationofapreferred-axishoming missile controller, someofthe underlying conceptscanbe used to develop atechniquethatcanfunctionin the presenceofcontrol constraints:(l)Useofa "dual system"thatcanbe used to maintain a controllable system (trajectory); (2) an "optimal decision strategy" to minimize the deviation betweentheactualanddesiredtrajectorygenerated by the "dual system;"and(3) initiallysaturatedcontrolandoptimal (realtime) selectionofthe switchingsurfacetolinearcontrolwithzero terminalerror.However, inadditionto,andcompounding the limitations imposedbycontrol constraints, wemustalso consider the sensitivityofthecontrol to unmodeled disturbancesandrobustnessunderplant variations.Inthe stochastic problem, there are threemajorsourcesofplantvariations.First,therewill be modeling errors (linearization/reductions)thatwill causethedynamicsofthe system to evolve in adifferentor"perturbed" fashion. Second,theremay betheunmodeleduncertaintyinthe system statedueto Gaussian assumptions.Andfinally,inthe fixed final time problem, there may beerrorsinthe final time, especiallyifitisafunctionoftheuncertain stateorimpactedbythe modeling reductions. Since theprimaryobjectiveofthis researchisthe zeroerrorcontrolofa dynamical systeminfixed time, mostofthemore recent 25
PAGE 41
26optimization techniques (eg. LQG/LTR,WXl) did not apply. At this time, these techniques seemedtobe more attuned to loop shapingorrobust stabilization questions. A fundamental proposition that forms the basisofReachable Set Controlisthat excessive terminal errors encountered when using an optimal feedback control for an initially controllable trajectory (a controllable system that can meet the boundary conditions with allowable control values) are caused by the combinationofcontrol constraints and uncertainty (errors) in the target set stemming from unmodeled plant perturbations (modeling errors)orset point dynamics. First, a distinction must be made between a feedback and closed-loop controller. Feedback controlisdefinedasa control system with real-time measurement data fed back from the actual system but no knowledgeofthe form, precision, or even the existenceoffuture measurements. Closed-loop control exploits the knowledge that the loop will remain closed throughout the future intervaltothe final time.Itaddstothe information provided to a feedback controller, anticipates that measurements will be taken in the future, and allows prior assessmentofthe impactoffuture measurements.IfCertainty Equivalence applies, the feedback lawisa closed-loop law.Underthe Linear Quadratic Gaussian (LQG) assumptions, thereisnothing to be gained by anticipating future measurements. In the mathematical optimization, external disturbances can be rejected, and the mean valueofthe terminal error can be made arbitrarily close to zero by a suitable choiceofcontrol cost. For the following discussion, the "system" consistsofa controllable plant and an uncontrollable referenceortarget. The system stateisthe relative difference between the plant state and reference. Since changes in the system
PAGE 42
27boundary condition can be caused by either a change in the reference point or plant output perturbations similartothose discussed in Chapter II, some definitions are necessary. The setofboundary conditions for the combined plant and target system, allowing for unmodeled plant and reference perturbations, will be referredtoasthe target set. Changes, or potential for change, in the target set caused only by target (reference) dynamics will be referredtoasvariations in the set point. The magnitudeofthese changesisassumed to be bounded. Admissible plant controls are restrictedtoa control restraint set that limits the input vector. Since there are bounds on the input control, the system becomes non-linear in nature, and each trajectory must be evaluated for controllability. Assume that the system (trajectory)ispointwise controllable from the initialtothe boundary condition. Before characterizing the effectsofplant and set point variations, we must consider the formofthe plant and it's perturbations.Ifweassume that the plantisnonlinear and time-varying, thereisnot much that can be deduced about the target set perturbations. However,ifhave a reduced order linear modelofa combined linear and nonlinear process,ora reasonable linearizationofa nonlinear model, then the plant can be consideredaslinear and time-varying. For example, in the caseofa Euclidean trajectory. the system model (a double integrator)isexact and linear. Usually, neglected higher order or nonlinear dynamics or constraints modify the accelerations and lead to trajectory (plant) perturbations. Consequently, in this case, the plant can be accurately representedasa Linear Time Invariant System with (possibly) time varying perturbations.
PAGE 43
28Consider the feedback interconnectionofthe systems K and P where Kisa sampled-data dynamic controller and P the (continuous) controlled system:r-K uG--p_ .: Figure4.1Feedback System and Notation Assuming that the feedback systemiswell defined and Bounded Input Bounded Output (BIBO) stable, at any sample time ti, the system can be defined in termsofthe following functions: e(ti)=r(ti) y(ti) u( ti)=Ke( ti)y(ti)=Pu( ti)(1) (2)(3)
PAGE 44
with the operator G29G[K,P]asthe operator that maps the input e(ti)tothe output y(ti) [34]. At any time, the effectofa plant perturbation DoP can alsobecharacterizedasa perturbation in the target set. or thenIfP=Po + DoP P= P(I+DoP) y(ti)= YO(tj) + Doy(ti) (4a)(4b)(5)where Doy(ti) represents the deviation from the "nominal" output caused by either the additive or multiplicative plant perturbation. Therefore, e(ti)=r(ti) (YO(ti)+ Doy(ti = (r(tj) + Doy(ti -YO(ti)= Dor(ti) -YO(ti)(6)(7)(8)with Dor(ti) representing a change in the target set that was unknown to the controller. These changes are then fed backtothe controller but could be handled a priori in a closed loop controller designastarget set uncertainty. Now consider theeffectofconstraints.Ifthe controlisnot constrained, and target set errors are generated by plant variationsortarget maneuvers, the feedback controller can recover from these intermediate target set errors by using large (impulsive) terminal controls. The modeled problem remains linear. While the trajectoryisnot the optimal closed-loop trajectory, the trajectoryisoptimal based on the model and information set available. Even with unmodeled control variable constraints, and a significant dis placementofthe initial condition, an exact plant model allows the linear stochastic optimal controllertogenerate an optimal trajectory. The switching time from saturated to linear controlisproperly (automatically) determined and,
PAGE 45
30asin the linear case, the resulting linear control will drive the statetowithin an arbitrarily small distance from the estimateofthe boundary condition.Ifthecontrol constraint set covers the rangeofinputs required by the control law, the law will always be able to accommodate target set errors in the remaining time-to-go. This is, in effect, the unconstrained case. If, however, the cost-to-goishigherand/orthe deviation from the boundary conditionisofsufficient magnitude relative to the time remainingtorequire inputs outside the boundaryofthe control constraint set, the system will not follow the trajectory assumed by the system model.Ifthisisthe caseastime-to-goapproaches zero, the boundary condition will not be met, the systemisnot controllable (to the boundary condition).Astime-to-go decreases, the effectsofthe constraints become more important. With control input constraints, and intermediate target set errors caused by unmodeled target maneuvers or plant variations, it may not be possible for the linear control lawtorecover from the midcourse errors by relying on large terminal control. In this case, an optimal trajectoryisnot generated by the feedback controller, and, at the final time, the systemisleft with large terminal errors. Consequently,ifexternal disturbances are adequately modeled, terminal errors that are ordersofmagnitude larger than predicted by the open loop optimal control are caused by the combinationofcontrol constraints and target set uncertainty.
PAGE 46
31Linear Optimal Control with Uncertainty and Constraints An optimal solution must meet the boundary conditions. To accomplish this, plant perturbations and constraints must be considered a priori. They should be includedasa priori information in the system model, they must be physically realizable, and they must be deterministic functionsofa priori information, past controls, current measurements, and the accuracyoffuture measurements. From the control pointofview,wehave seen that the effectofplant parameter errors and set point dynamics can be groupedastarget set uncertainty. This uncertainty can cause a terminally increasing acceleration profile even when an optimal feedback control calls for a decreasing input (see Chapter 5). With the increasing acceleration caused by midcourse target set uncertainty, the most significant terminal limitation becomes the control input constraints. (These constraints not only affect controllability, they also limit how quickly the system can recover from errors.)Ifthe initial controlissaturated while the terminal portion linear, the controlisstill optimal.Ifthe final controlisgoingtobesaturated, however, the controller must account for this saturation. The controller could anticipate the saturation and correct the linear portionofthe trajectorytomeet the final boundary condition. This control, however, requires a closed formsolution for x(t), carries an increased cost for an unrealized constraint, and is known to be valid for monotonic ( single switching time) trajectories only [11]. Another technique availableisLQG synthesis. However, LQG assumes controllability in minimizing a quadratic cost to balance thecontrol error and
PAGE 47
32input magnitudes.Aswehave seen, the effectsofplant parameter and reference variations, combined with control variable constraints, can adversely impact controllability. The challengeofLQGisthe proper formulationofthe problem to function with control variable constraints while compensating for unmodeled set point and plant variations. Reachable Set Control uses LQG synthesis and overcomes the limitationsofan anticipative controltoinsure a controllable trajectory. Control Technique Reachable Set Control can be thoughtofasa fundamentally different robust control technique based on the concepts outlined above. The usual discussionofrobust feedback control (stabilization) centers on the developmentofcontrollers that function even in the presenceofplant variations. Using either a frequency domain or state space approach, and modeling the uncertain ty, bounds on the allowable plant or perturbations are developed that guarantee stability [35]. These bounds are determined for the specific plant under consideration and a controller is designedsothat expected plant variations are contained within the stability bounds. Building on ideas presented above, however, this same problem can be approached in an entirely different way. This new approach begins with the same assumptionsasstandard techniques, specifically a controllable system and trajectory. But, with Reachable Set Control,wewill not attempt to model the plant or parameter uncertainty, nor the set point variation.Wewill, instead, reformulate the problemsothat the system remains controllable, and thus stable, throughout the trajectory even in the presenceofplant perturbations and severe control input constraints.
PAGE 48
33Before we developanimplementable technique, considerthedesired resultofReachable Set Control (and the originofthe name) by using atwo-dimensional missile intercept problemasan example.Attime t= t}. not any specific time during the intercept, the targetisatsome location T1andthe missileisatMIasshown in Figure 4.2. Consider these locationsasoriginsoftwo independent, target and missile centered, reference systems.Fromthese initial locations, given the control inputs available, reachable sets for each system can be definedasafunctionoftime (not shown explicitly).Thetarget setiscircular because is maneuver directionisunknownbutits capabilitybounded,and the missile reachable set exponential because the x axis controlisconstant and uncontrollable while the z axis accelerationissymmetricandbounded.TheobjectiveofReachable Set Controlisto maintain the reachable targetsetin the interiorofthe missile reachable set. Hence, Reachable Set Control.xTargetReachableM1zMissileReachableSetFigure 4.2 Reachable Set Control Objective
PAGE 49
34As stated, Reachable Set Control would bedifficultto implementasa control strategy. Fortunately, however,furtheranalysis leads to a simple, direct, and optimal technique thatisvoidofcomplicated algorithms orad-hocprocedures. First, consider the process. The problem addressed is the controloffixed-terminal-timesystems. The true cost is the displacementofthe state at the final time and only at the final time. In the terminal homing missile problem, thisisthe closest approach, or miss distance. In another problem, it may be fuel remaining at the final time, or possibly a combinationofthe two. In essence, with respect to the direct applicationofthis technique, thereisno preferenceforone trajectory over another or no intermediate cost based on the displacementofthe state from the boundary condition. The term "direct application" was used because constrained path trajectories, suchasthose required by robotics, or the infinite horizon problem, like the controlofthe depthofa submarine can be addressed by separating the problem into several distinctintervals--eachwith a fixed terminaltime--ora switching surface when the initial objective is met [36]. Given a plant with dynamics x(t)=f(x,t)+g(u(w),t) y(t)=h(x(t),t) modeled by x(t)=F(t)x(t)+G(t)u(t) yx(t)=H(t)x(t) x(to)=xo(9) (10)
PAGE 50
35with final condition x(tf) and a compact control restraint setOx.Let Ox denote the setofcontrols u(t) for which u(t)E Ox for tE [0,00).The reachable set X(tO,tf,xO'Ox) == (x:x(tf)=solutionto(10) withxofor some u()EM(Ox) }isthe setofall states reachable fromxoin time tf. In addition to the plant and model in(9&10),wedefine the reference(11)r(t)=a(x,t)+b(a(w),t) y(t)=c(x(t),t) modeled by r(t)=A(t)r(t)+B(t)a(t)Yr(t)=C(t)r(t) and similarly defined set R(to,tf,ro,Or), r(to)=rO(12) (13) R(to,tf,ro,Or) == (r:r(tf)=solution to (13) withrOfor some a()EM(Or) }asthe setofall reference states reachable fromrOin time tf. (14) Associated with the plant and reference, at every time t,isthe following system: e(t)=yx(t) -Yr(t)(15) ;;m (10&13), we see that yx(t) andYr(t)are output functions that incorporate the significant characteristicsofthe plant and reference that will be controlled. The design objectiveise(tf)=0(16)and we want to maximize the probabilityofsuccess and minimize the effectoferrors generated by the deviationofthe reference and plant from their associated models. To accomplish this with a sampled-data feedback control law,
PAGE 51
36wewill select the control u(ti) such that, at the next sample time (ti+l), the target reachable set will be covered by the plant reachable set and, in steady state,ife(tf)=0, the control will not change. Discussion Recalling that the performance objective at the final timeisthe real measureofeffectiveness, and assuming that the terminal performanceisdirectly related to target set uncertainty, this uncertainty should be reduced with time-to-go. Now consider the trajectory remembering that the plant model is approximate (linearized or reduced order), and that the reference has the capability to change and possibly counter the control input. (This maneuverabili ty does not havetobe taken in the contextofa differential game.Itis only intended to allow for unknown set point dynamics.) During the initial portionofthe trajectory, the target set uncertaintyisthe highest. First, at this point, the unknown (future) reference changes have the capabilityofthe largest displacement. Second, the plant distance from the uncertain set pointisthe greatest and errors in the plant model will generate the largest target set errors becauseofthe autonomous response and the magnitudeofthe control inputs required to move the plant state to the set point. Along the trajectory, the contributionofthe target (reference) maneuvera bility to set point uncertainty will diminish with time. This statement assumes that the target (reference) capability to change does not increase faster than the appropriate integralofits' input variable. Regardlessofthe initial maneu verabilityofthe target, the time remainingisdecreasing, and consequently, the
PAGE 52
37ability to move the set point decreases. Target motionissmaller and it's positionismore and more certain. Selectionofthe control inputs in the initial stagesofthe trajectory that will result in a steady state control (that contains the target reachable set within the plant reachable set) reduces target set uncertainty by establishing the plant operating point and defining the effective plant transfer function. At this point,wedo not have a control procedure, only the motivation to keep the target set within the reachable setofthe plant along with a desire to attain steady state performance during the initial stagesofthe trajectory. The specific objectives are to minimize target set uncertainty, and most importantly, to maintain a controllable trajectory. The overall objectiveisbetter performance in termsofterminal errors. Procedure A workable control law that meets the objectives can be deduced from Figure 4.3. Herewehave the same reachable setforthe uncertain target, but this time, several missile origins are placed at the extremesoftarget motion. From these origins, the systemisrun backward from the final timetothe current time using control values from the boundaryofthe control constraint set to provide a unique setofstates that are controllable to the specific origin.Ifthe intersectionofthese setsisnon-empty, any potential target locationisreachable from this intersection. Figure 4.4 is similar, but this time the missile control restraint setisnot symmetric. Figure 4.4 shows a case where the missile acceleration controlisconstrainedtothe set A=[Amin,Amax] where 0 Amax(17)
PAGE 53
z38xTargetReachable AlltargetpositionsReachableFigure 4.3 IntersectionofMissile Reachable Sets BasedonUncertain Target Motion and Symmetric ConstraintsxTargetReachableSetAllTargetLocationsReachableFigure 4.4 IntersectionofMissile Reachable Sets Based on Uncertain Target Motion and Unsymmetric Constraints
PAGE 54
39Since controllabilityisassumed, which for constrained control includes the control bounds and the time interval, the extreme left and right (near and far) pointsofthe set point are included in the set drawn from the origin. To implement the technique, construct a dual system that incorporates functional constraints, uncontrollable modes, and uses a suitable control value from the control constraint setasthe input. From the highest probability target positionatthe final time, run the dual system backward in time from the final boundary condition. Regulate the plant (system) to the trajectory defined by the dual system. In this way, the fixed-final-time zero terminal error controlisaccomplished by re-formulating the problemasoptimal regulationtothe dual trajectory. In general, potential structuresofthe constraint set preclude a specific point (origin, center, etc.) from always being the proper inputtothe dual system. Regulation to a "dual" trajectory from the current target position will insure that the originofthe target reachable set remains within the reachable setofthe plant. Selectionofa suitable interior point from the control restraint setasinputto the dual system will insure that the plant has sufficient control power to prevent the target reachable set from escaping from the interiorofthe plant reachable set. Based on unmodeled set point uncertainty, symmetric control constraints, and a double integrator for the plant, a locus exists that will keep the target in the centerofthe missile reachable set.Ifthe set pointisnot changed, this trajectory can be maintained without additional inputs. For a symmetric control restraint set, especiallyasthe time-to-go approaches zero, Reachable Set Controliscontrol to a "coasting" (null control) trajectory.
PAGE 55
40Ifthe control constraints are not symmetric, such asFigure4.4, a locusofpoints that maintains the target in thecenterofthe reachable set is the trajectory based on the systemrunbackward from the final time target location with the acceleration command equal to the midpointofthe set A. Pictured in Figures 4.2 to 4.4 were trajectories that are representativeofthe double integrator. Other plant models would havedifferenttrajectories. Reachable Set Control is a simple techniqueforminimizing the effectsoftarget set uncertainty and improving terminal the performanceofa large classofsystems.Wecan minimize the effectsofmodeling errors (or target setuncertainty) by a linear optimal regulator that controls the system to a steady state control. Given the well known and desirable characteristicsofLQGsynthesis, this technique canbeusedasthe basisforcontrol to the desired "steady state control" trajectory.Thetechnique handles constraints by insuringaninitially constrained trajectory. Also, since the large scale dynamics are controlled by the "dual" reference trajectory, the tracking problem be optimized to the response timeofthe systemunderconsideration. This results inan"adaptable" controller because gains are basedonplant dynamicsandcost while the overall system is smoothlydrivenfrom some large displacement to a region where the relatively high gainLQGcontroller will remain linear.
PAGE 56
CHAPTERV REACHABLE SETCONTROLEXAMPLEPerformance Comparison Reachable SetandLOGControlInorderto demonstrate theperformanceof"Reachable Set Control" we will contrast its performance with theperformanceofa linear optimal controllerwhenthereistarget setuncertaintycombinedwithinputconstraints. Consider,forexample, thefinitedimensional linear system:withthequadratic cost x(to)=xo(1)whereJ1=-XfTpfXf 2(2)andtfE [0,00) '1 0Applicationofmaximum principle yields the following linear optimal control law:where1u=+-x(tf)(t-tf) '1 xo+xo tfx(tf)=1+41(3) (4)
PAGE 57
42Appropriately defining t, to, andtf,the control law can be equivalently expressed in an open loop or feedback form with the latter incorporating the usual disturbance rejection properties. The optimal control will tradeoff the costofthe integrated square input with the final error penalty. Consequently, even in the absenceofconstraints, the terminal performanceofthecontrolisa functionofthe initial displacement, time allowed to drive the statetozero, and the weighting factor ,. To illustrate this, Figure5.1presents the terminal states (miss distance and velocity)ofthe linear optimal controller. This plotisa compositeoftrajectories with different run times ranging from 0to3.0 seconds. The figure presents the valuesofposition and velocity at the final time t=tfthat result from an initial positionof1000 feet and with velocityof1000feet/sec with =104 Figure 5.2 depicts,asa functionofthe run time, the initial acceleration (at t=0.) associated with eachofthe trajectories shown in Figure 5.1. From these two plots, the impactofshort run timesisevident: the miss distance will be higher, and the initial acceleration command will be greater. Since future set point (target) motionisunknown, the suboptimal feedback controllerisreset at each sample time to accommodate this motion. The word resetissignificant. The optimal controlisa functionofthe initial condition at time t=to, time, and the final time. A feedback realization becomes a functionofthe initial condition and time to go only. In this case, set point motion (target set uncertainty) can place the controller in a position where the time-to-goissmall but the state deviationislarge.
PAGE 58
Velocity2000o-2000-4000 -6000 -8000-10000 -12000-14000o20043400600800Position10001200Figure5.1Terminal PerformanceofLinear Optimal ControlAccelerationo-100000--200000-300000v-400000o0.51Final Time1.52Figure 5.2 Initial AccelerationofLinear Optimal Control
PAGE 59
44While short control times will result in poorer performance and higher accelerations, it does not take a long run time to drive the terminal error to near zero. Also, from (4)wesee that the terminal error can be driventoan arbitrarily small value by selectionofthe control weighting. Figure5.1presented the final valuesoftrajectories running from 0 to 3 seconds. Figures 5.3 through 5.5 are plotsofthe trajectory parameters for the two second trajectory (with the same initial conditions) along with the zero control trajectory values. These values are determined by starting at the boundary conditionsofthe optimal control trajectory and running the system backward with zero acceleration. For example,ifwestart at the final velocity and run backwards in time along the optimal trajectory, for each point in time, thereisa velocity (the null control velocity) that will take the corresponding positionofthe optimal control trajectorytothe boundary without additional input. The null control position begins at the origin at the final time, and moving backward in time,isthe position that will take the system to the boundary condition at the current velocity. Therefore, these are the positions and velocities (respectively) that will result in the boundary condition without additional input.Ast=>tfthe optimal trajectory acceleration approaches zero. Therefore, the zero control trajectory converges to the linear optimal trajectory.Ifthe system has a symmetric control constraint set, Reachable Set Control will control the system position to the zero control (constant velocity) trajectory.
PAGE 60
Accelerationo-500-1000-1500 -2000-250000.5451Time1 .52Figure 5.3 Linear Optimal Acceleration vs Time r--Null Control VelocityVelocity1500 10005000-500........-1000 -15000.50..............1Time1.52Figure 5.4 Linear Optimal Velocity vs Time
PAGE 61
46Position20001000......o-1000-2000-3000o NullControlPosition0.511.5TimeFigure 5.5 Linear Optimal PositionvsTime2Consider now the same problemwithinputconstraints. Since Vet)is a linear functionoftime and the final state,itismonotonicandthe constrained optimal controlisIu=SAT(-x(tf)(t-tf(5) "I In this case, controllability is in question,andisa functionofthe initial conditionsandthetime-to-go.Assuming controllability, the final state will be given by: x(tf)=xQ+ xQtf a(tl)SGN(x(tf)[tf-(tl/2)](tf-t1)31+---(6)
PAGE 62
47wheretlisthe switching time from saturated to linear control. The open loop switch time can be shown to be(7)or the closed loop control can be used directly. In either case, the optimal control will correctly control the system to a final state X(tf) near zero. Figures 5.6 through 5.8 illustrate the impactofthe constraint on the closed loop optimal control. In each plot, the optimal constrained and unconstrained trajectoryisshown.Acceleration1000o-1000-2000-3000 -4000.............-500000.51Time1 .52Figure 5.6 Unconstrained and Constrained Acceleration
PAGE 63
48Velocity2000o-2000-4000Constrained o0.51Time1 .52Figure 5.7 Unconstrained and Constrained VelocityvsTime ...... ...........21 .51Time0.5Constrained Position------l Position250020001500 10005000 0Figure 5.8 Unconstrained and Constrained Position vs Time
PAGE 64
49Now consider the effectsoftarget set uncertainty on the deterministic optimal control by using the same control law for a 2.0 second trajectory where the boundary conditionisnot constantbutchanges. The reason for the target uncertainty and selectionofthe boundary condition can be seen by analyzing the componentsofthe modeled system. Assume that system actually consistsofan uncontrollable reference (target) plant as wellascontrolled (missile) plant with the geometry modeled by the difference in their states. Therefore, the final set point (relative distance)iszero,butthe boundary condition along the controlled (missile) trajectoryisthe predicted target position at the final time. This predicted position at the final timeisthe boundary condition for the controlled plant. Figures 5.9 through5.11are plotsoflinear optimal trajectories using the control law in (5,6). There are two trajectoriesineach plot. The boundary condition for one trajectoryisfixed at zero, the set point for the other trajectory is the pointwise zero control value (predicted target state at the final time). Figures 5.9 through5.11demonstrate the impactofthis uncertainty on the linear optimal control law by comparing the uncertain constrained control with the constrained control that has a constant boundary condition.
PAGE 65
Acceleration1000o-1000 -2000 -3000 -400050......UncertainTarget Set -500000.51Time1 .52Velocity2000o-2000 -4000Figure 5.9 Acceleration ProfileWithand Without Target Set Uncertainty TargetSet \ .......................o0.51Time1 .52Figure 5.10 Velocity vs TimeWithand Without Target Set Uncertainty
PAGE 66
51PositionUncertainTargetSet ---------1400 1200 1000800600400200o00.51Time1 .52Figure5.11PositionvsTime With and Without Target Set Uncertainty When thereistarget set uncertainty, simulated by the varying set point, the initial accelerationisinsufficient to prevent saturation during the terminal phase. Consequently, the boundary conditionisnot met. The final setofplots, Figures 5.12 through 5.14, contrast the performanceofthe optimal LQG closed loop controller that we have been discussing and the Reachable Set Control technique.Inthese trajectories, the final set pointiszerobutthereistarget set uncertainty again simulatedbya time varying boundary condition (predicted target position) that convergestozero. Although properly shownasa fixed final time controller, the Reachable Set Control results in Figures 5.12 through 5.14 are from a simple steady state (fixed gain) optimal tracker referencedtothe zero control trajectory r.
PAGE 67
52Thesystem modelforeach techniqueis(8)withx(to)=(x-r)oThelinear optimal controller has a quadratic costofJ=2Thereachable set controller minimizes(9)J= J : 1)[(x-r)TQ(x-r)+u(r)Tu(r)]dr(10)And,ineithercase, the value for r(t)isthe position that will meet theboundaryconditionatthe final time withoutfurtherinput.Acceleration1000o-1000 -2000 -3000-4000ReachableSetControl -500000.51Time1 .52Figure 5.12 Acceleration vsTimeLQGand Reachable Set Control
PAGE 68
Velocity2000o-2000 -400053.......................................................-__-_. "'-ReachableSetControlo0.51 Time1 .5 2".'.ReachableSetControl Position140012001000800600400200o0Figure 5.13 Velocity vs Time LQG and Reachable Set Control"..............0.51TimeFigure 5.14 Position vs Time LQG and Reachable Set Control1 .5'"2
PAGE 69
54Summary The improved performanceofReachable Set Controlisobvious from Figure 5.14. While demonstrated for a specific plant, and symmetric control constraint set, Reachable Set Controliscapableofimproving the terminal performanceofa large classofsystems.Itminimized the effectsofmodeling errors (or target set uncertainty) by regulating the system to the zero control state. The technique handled constraints and insured an initially constrained trajectory. The tracking problem could be optimized to the response timeofthe system under considerationbysmoothly driving the system from some large displacementtoa region where the relatively high gain LQG controller remained linear.
PAGE 70
CHAPTER VI REACHABLE SET CONTROL FOR PREFERRED AXIS HOMING MISSILES As statedinChapter II, the most promising techniques that can extend the inertial point mass formulation are based on singular perturbations [37,38,39]. When applied to the preferred axis missile, eachofthese techniques leads to a controller that is optimal in some sense. However, a discussionof"optimality" notwithstanding, the best homing missile intercept trajectory is the one that arrives at the final "control point" with the highest probabilityofhitting the target. This probability can be broken down into autonomous and forced events.Ifnothing is changed, what is the probabilityofa hitorwhat is the miss distance?Ifthe target does not maneuver, can additional control inputs result in a hit? And, in the worse case,ifthe target maneuvers (or an estimation error iscorrected) will the missile have adequate maneuverability to correct the trajectory? Noneofthe nonlinear techniques based on singular perturbations attempt to control uncertaintyoraddress the terminally constrained trajectories caused by increasing acceleration profile. Unfortunately, an increasing acceleration profile has been observed in allofthe preferred axis homing missile controllers. In many cases, the genericbank-to-turnmissileof[11,18] was on all three constraints (Ny,Nz,P) during the latter portionofthe trajectory.Ifthe evading targetisable toputthe missileinthis position without approaching it's own maneuver limits,itwill not be possible for the missile to counter the final evasive maneuver. The missileisno longer controllable to the target set. The "standard" solution to the increasing acceleration profileisa varyingcontrol cost.55However, without additional
PAGE 71
56 additional modifications, this typeofsolution results in a trajectorydependentcontrol. As we have seen, Reachable Set Control is anLQG controlimplementationthatmoves the system to the point wherefurtherinputs are not required. A Reachable Set Controller that willrejecttarget and system disturbances,cansatisfy both the mathematicalandheuristic optimality requirementsbyminimizing the cost yet maintaining a controllable system. Since the roll control hasdifferentcharacteristics, the discussionofthepreferredaxis homing missile controller using the Reachable Set Control technique will be separated into translationalandroll subsystems.Thetranslational subsystem has a suitable null control trajectorydefinedbythe initial velocity and uncontrollable acceleration provided by the rocket motor.Theroll subsystem, however, is significantlydifferent.Inorderforthepreferredaxis missile to function, thepreferredaxis must be properly aligned. Consequently, both roll angleerrorandroll rate should be zero at all times. In this case, the null control trajectory collapses to the origin. Acceleration Control System Model Since we want to control the relative target-missile inertial system to the zero state, the controller will bedefinedinthis reference frame. Eachofthe individual system states aredefined(in relative coordinates) as target state minus missile state. Begin with the deterministic system: x(t)=Fx(t)+Gu(t)(I)
PAGE 72
57where andxyzx=Vx Vy Vzu= Since the autopilot modelisa linear approximation and the inertial model assumes instantaneous response, modeling errors will randomly affect the trajectory. Atmospheric and other external influences will disturb the system. Also, the determinationofthe state will requirethe useofnoisy measurements. Consequently, the missile intercept problem should be approached via a stochastic optimal control law. Because the Reachable Set Control technique will minimize the effectofplant parameter variations (modeling errors) and un modeled target maneuverstomaintain controllability, we can use an LQG controller. Assuming Certainty Equivalence, this controller consistsofan optimal linear (Kalman) filter cascaded with the optimal feedback gain matrixofthe corresponding deterministic optimal control problem. Disturbances and modeling errors can be accounted for by suitably extending the system description[40]:x(t)=F(t)x(t)+G(t)u(t)+Vs(t) by adding a noise process V s (") to the dynamics equations with(2) (3)
PAGE 73
58Therefore, let the continuous time state description be formally given by the linear stochastic differential equationx(t)= ct>( t, to)x( to) dx(t)=F(t)x(t)dt+G(t)u(t)dt+L(t)df3(t) (with13(,.)a Wiener process) that has the solution: ct>( t,r)G(r)u(r)dr+
PAGE 74
state to converge with zero error.59Since the optimal solution to the linear stochasticdifferentialequation is a Gauss-Markov process, time correlated processescanbe includedbyaugmenting the system state to include the disturbance process.Letthe time-correlated target (position) disturbance be modeled by the following: T(t)=N(t)T(t)+Wt(t)(8)with[Tx(t)IT(t)=Ty(t) Tz(t) and =Wt(t) While the target disturbance resulting from anunknownacceleration is localized to a single plane with respect to the body axisofthe target, the target orientation is unknown to the inertial model. Consequently, following the methodologyofthe Singer Model, each axis will be treated equally [41]. Since the disturbance is firstorderMarkov, it's components will be:andN(t)=-(I/Tc)[I] Wt(t)=(20't2 / T c)[I](9) (10)where Tcisthe correlation time, and O'r isthe RMS valueofthe disturbance process.ThePower Spectral Densityofthe disturbanceis: Wtt(W) =20't2 / T cw2+O/Tc)2(11) Figure 6.1 summarizes the noise interactions with the system.
PAGE 75
60WhiteGaussianNoiseJ-------'AtmosphericdisturbancesActuatorErrorsAutopilotErrors"DeterministicControls LinearSystem OutputsPhysicalModelSystemResponse Jl TargetAccelerationsShapingFilterWhite Gaussian NoiseFigure6.1Reachable Set Control Disturbance processes.Withappropriate dimensions, the nine state (linear) augmented system model becomes:[ ]T(t)=Reference Model Reachable Set Control requires a supervisory steering control (reference) that includes the environmental impact on the controlled dynamic system. Recalling the characteristicsofthe dual system, one was developed that explicitlyran(I) backward in time after determining the terminal conditions. However, in developing this control for this preferred axis missile a numberoffactors actually simplify the computationofthe reference trajectory:
PAGE 76
61(I)Thecontrol constraint setforthispreferredaxis missile is symmetric.Consequently.thereferencetrajectoryforaninterceptcondition.is a null control (coasting)trajectory.(2)Thebodyaxis X acceleration isprovidedbythemissile motor,andisnotcontrollablebutknown. This uncontrollableaccelerationwillcontributetothetotal inertial acceleration vector.mustbeconsideredbythe controller.andistheonlyacceleration present onanintercept(coasting)trajectory.(3)Theterminationoftheinterceptistheclosestapproach.whichnowbecomes thefixed-final-time(tf).Thetime-to-go(tgo) isdefinedwith respect tothecurrenttime (t) by: t=tftgo(13)(4)Thefinalboundaryconditionforthesystem state (target minus missile) is zero.Insummary,theinterceptpositions are zero.theinitial velocity is given.andthe average acceleration is a constant.Therefore.itissufficientto reversethedirectionoftheinitial velocityandaverageaccelerationthenrunthesystemforwardintimefortgo secondsfromtheorigintodeterminethecurrentpositionofthecoasting trajectory. Let:withandr(t)=A(t)r(t)+B(t)a(t)r(O)=rO=0A(t)=F(t)andB(t)=G(t)(14)thenr(t) isthepointfromwhich theautonomoussystem dynamics will takethesystem todesiredboundarycondition. Becauseofthe disturbances. targetmotion,andmodelingerrors,futurecontrolinputsarerandomvectors.Therefore,thebest policy isnotto
PAGE 77
62determine the input over the control period [to,tfl a priori buttoreconsider the situation at each instant t on the basisofall available information. At each update,ifthe systemiscontrollable, the reference (and system state) will approach zeroastgo approaches zero. Since the objectiveofthe controlleristo drive the system state to zero,wedo not require a tracker that will maintain the control variable at a desired non-zero value with zero steady state error in the presenceofunmodeled constant disturbances. There are disturbances, but the final set pointiszero, and therefore, aPIcontrollerisnot required. Roll Control Definition The roll modeismost significant sourceofmodeling errors in the preferred axis homing missile. While non-linear and high order dynamics associated with the equationsofmotion, autopilot, and control actuators are neglected, the double integratorisan exact modelfordetermining inertial position from inertial accelerations. The linear system, however,isreferenced with respect to the body axis. Consequently, to analyze the complete dynamics, the angular relationship between the body axis and inertial references must be considered. Recall Friedland's linearized (simplified) equations. The angular relationships determine the orientationofthe body axis reference and the roll rate appeared in the dynamicsofall angular relationships. Yet,tosolve the system using linear techniques, the system must be uncoupled via a steady state (Adiabatic) assumption. Also, the roll angleisinertially defined and the effectofthe linear accelerations on the erroristotally neglected.
PAGE 78
63Froma geometric pointofview, this mode controls the rangeofthe orthogonal linear acceleration commandsandthe constrained controllabilityofthe trajectory. With a20:I ratio in thepitchandyaw accelerations, the ability topointthepreferredaxis in the "proper" direction is absolutely critical. Consequently, effective roll control is essential to the performanceofthepreferredaxis homing missile.Thefirst problem in defining the roll controller, is the determinationofthe "proper" direction. There are two choices.Thepreferredaxis could be aligned with the target positionorthedirectionofthe commanded acceleration.Thefirstselectionisthe easiest to implement.Theseeker gimbal angles provide adirectmeasureofintercept geometry (Figure 6.2), and the roll angleerrorisdefineddirectly:(15)TargetLOS eyzxFigure 6.2. Roll AngleErrorDefinitionfromSeeker Angles. This selection, however,isnot the most robust. Depending on the initial geometry, the intercept point may not beinthe planedefinedbythecurrent
PAGE 79
64lineofsight (LOS) and longitudinal axisofthe missile. In this case, the missile must continually adjust its orientation (roll) to maintain the target in the preferred plane.Asrange decreases the angular rates increase, with the very real possibilityofsaturation and poor terminal performance. Consequently, the second definitionofroll angle error should be used. Considering the dynamicsofthe intercept, however, aligning the preferred axis with the commanded acceleration vectorisnotasstraightforwardasit seems. Defining the roll angle erroras 0e =Tan-I{A y /A z } (16) leadstosignificant difficulties. From the previous discussion, itisobvious that roll angle errors must be minimizedsothat the preferred axis acceleration canbeusedtocontrol the intercept. The roll controller must have a high gain. Assume, for example, the missileison the intercept trajectory. Therefore, both A y and A z will be zero. Now,ifthe target moves slightly in theYbdirection and the missile maneuvers to correct the deviation, the roll angle error instan taneously becomes 90 degrees. High gain roll control inputs to correct this situation are counter productive. The small A y may be adequate to completely correct the situation before the roll mode can respond. Now, the combinationoflinear and roll control leads to instabilityasthe unnecessary roll rate generates errors in future linear accelerations. The problems resulting from the definitionofequation16can be overcomebere-examining the roll angle error. First, A y and A z combine to generate a resultant vector at an angle from the preferred axis. In the processofapplying constraints, the acceleration angle that results from the linear accelerations canbeincreased or decreased by the presenceofthe constraint.Ifthe angleisdecreased, the additional rollisneeded to line up the preferred axis and the
PAGE 80
65desired acceleration vector.Ifthe constrained (actual) acceleration angleisincreased beyond the (unconstrained) desired value by the unsymmetric actionofthe constraints, then the roll controller must allow for this "over control" caused by the constraints. Define the roll angle errorasthe difference between the actual and desired acceleration vectors after the control constraints are considered. This definition allows for the full skid toturncapabilityofthe missile in accelerat ing toward the intercept point and limits rolling to correct large deviations in acceleration angle from the preferred axis that are generated by small accelera tions. ReferringtoFigure 6.3, three zones can be associated with the following definitions: "ec =Tan-I(Ay/Az } 'lea =Tan-I(Ny/Nz } "er = "ec 'lea;ZoneIZoneIIZoneIIIFigure 6.3. Roll Control Zones.(17) (18) (19)
PAGE 81
66Here Ny and N z are the constrained acceleration values. In Zone I, "er =O.The linear acceleration can complete the intercept without further roll angle change. Thisisthe desired locus for the roll controller. Both A y and A z are limited in Zone II. Thisisthe typical situation for the initial positionofa demanding intercept. The objectiveofthe roll controlleristokeep the intercept acceleration outofZone III where only A yislimited. In this case, the A y accelerationisinsufficient to complete the intercept yet significant roll angle change may be requiredtomake the trajectory controllable. Controller A dual mode roll controllerwasdeveloped to accommodate the rangeofsituations and minimize roll angle error. Zone I requires a lower gain controller that will stabilize the roll rate and maintain "er small. Zones II and III require high gain controllers. To keep Zone II trajectories from entering Zone III, the "ec will be controlled to zero rather than the roll angle error. Since the linear control valueisalso a functionofthe roll angle error, roll angle errors are determined by comparing the desired and actual anglesofa fixed high gain reachable set controller.Ifthe actual linear commands are used and a linear accelerationissmall becauseoflarge roll angle errors, the actual amountofroll needed to line up the preferred axis and the intercept point, beyond the capabilityofthe linear accelerations, will not be available because they have been limited by the existing roll angle error that must be corrected. Unlike the inertial motions, the linear model for the roll controller accounts for the (roll) damping and recognizesthatthe inputisa roll rate change command: = -w" +wP(20)
PAGE 82
67Therefore, the roll mode elements (that will be incorporated into the model are) are: (21) Also, the dual mode controller will require an output function and weighting matrix that includes both roll angle and roll rate. Kalman Filter The augmented system model (13)isnot block diagonal. Consequently, the augmented system filter will not decouple into two independent system and reference filters. Rather, a single, higher order filter was required to generate the state and disturbance estimates. A target model (the Singer model)wasselected and modified to trackma-neuvering targets from a Bank-To-Turnmissile [41,42]. Using this model, a continuous-discrete Extended Kalman Filter was developed. The filter used a9 state target model for the relative motion (target missile): (22) with u(t) the known missile acceleration, N the correlation coefficient, and Wt(t) an assumed Gaussian white noise input with zero mean. Azimuth, elevation, range, and range rate measurements were available from passive JR, semi-active, analog radar, and digitally processed radar sensors. The four measurements are seeker azimuth (,p), seeker elevation(0),range (r), and range rate (dr/dt):
PAGE 83
68 6 =-Tan-l{z(x2 +y2)-1/2} t/J =+Tan-l{y/x}= 1r +Tan-l{y/x}r={x2+y2+z 2}-1/2 r= x 0x 0(23)Noise statistics for the measurements are a functionofrange, and are designed to simulate glintandscintillation in a relatively inexpensive missile seeker. In contrast to the linear optimal filter, theorderofthe measurements for the extended filterisimportant. In this simulation, the elevation angle (6) was processed first, followed by azimuth (t/J), range (r), and range rate(dr/dt).In addition, optimal estimates were available from the fusionofthe detailed (digital) radar model and IR seeker. Reachable Set Controller StructureTheTarget-Missile System is showninFigure 6.4. The combinationofthe augmented system state and the dual referencethatgenerates the minimum control trajectory for the reachable set concept is best describedasa CommandGenerator/Trackerandisshown in Figure 6.5.Ina single systemofequations the controller models the system response, including time correlated position disturbances, and provides the reference trajectory. Since only noise-corrupted measurementsofthe controlled system are available, optimal estimatesofthe actual states were used. Becauseofthe processing time requiredforthe filter and delays in the autopilot response, a continuous-discrete Extended Kalman Filter,anda sampled
PAGE 84
69data(discrete) controller was used. This controller incorporated discrete cross-coupling terms to control the deviations between the sampling timesaswell the capability to handle non-coincident sampleandcontrol intervals (Appendices B andC).Combining the linear and roll subsystems with a firstorderroll mode for the roll angle state, the model for the preferred axis homing missile becomes: the reference: r(t)=A(t)r(t)+B(t)a(t) with the tracking error: e(t)=[yx(t) Yr(t) ]=[H(t)I0I-C(t)] r(t)(25)(26)The initial stateismodeled asann-dimensional Gaussian random variable with meanxOand covariancePO.E{ws(t)ws(t)T}=Ws(t) is the strengthofthe system (white noise) disturbances to be rejected,andE{wt(t)Wt(t)T}=Wt(t)isaninput to a stationary firstorderGauss-Markov processthatmodels target acceleration. The positions are the primary variablesofinterest,andthe output matrices will select these terms. Along with the roll rate, these are the variables that will be penalized by the control costandthe states where disturbances will directly impact the performanceofthe system.
PAGE 85
In block form, with appropriate dimensions, the system matrices are:F(t)=A(t)=F=[0I]OOwG(t)=B(t)=G=[0]Iw(28)H(t)=C(t)=[Ihw ]N(t)=-(ljTc)[I]L(t)=IM(t)=Iwhere the Ow, I w and h w terms are required to specify the roll axis system and control terms:
PAGE 86
71{-wi=j=8{1i=j=8 (Ow>ij = (hw>ij =0otherwise0otherwise{+wi=8,j=4 (Iw)ij = I otherwise The performance objective for the LQG synthesisisto minimize an appropriate continuous-time quadratic cost: Js(t) = E{Jd(t)II(t)}(29)where J sisthe stochastic cost, I(t)isthe information set available at time t, and Jd a deterministic cost function:(30)Dividing the intervalofinterest into N+I intervals for discrete time control, and summing the integral cost generates the following (see AppendixC):(31)which can be related to the augmented state X=[xTr ]T by:][ X(tj) u(ti)](32)
PAGE 87
72In general, with the cost terms defined for the augmented state (AppendixC),the optimal (discrete) solution to the LQG tracker can be expressedas:where*=-[G(ti)] [ x(ti) T(ti) r(ti)](33)and G*(ti)=[R(ti) + GT(ti)P(ti+l)G(ti) ] -1 [ + ST(tj) ] P(ti)=Q(ti) + -[+ ST(ti) ]TG*(ti)(34) (35)Since only the positions (and roll rate) are penalized, the Riccati recursionisquite sparse. Consequently, by partitioning the gain and Riccati equations, and explicitly carrying out the matrix operations, considerable computationalim-provements are possible over the straightforward implementationofa19by19tracker (Appendix D). Application The tracking error and control costs were determined from the steady state tracker used in the example in Chapter5.First, missile seeker and aerodynamic limitations were analyzed to determine the most demanding intercept attainable by the simulated hardware. Then, autopilot delays were incorporated to estimate that amountoftime that a saturated control would require to turn the missile after correcting a 90 degree (limit case) roll angle error. The steady state regulator was used to interactively place the closed loop polesandselect a control cost combination that generated non zero control for the desired lengthoftime. These same values were used in the time varying Reachable Set Controller with the full up autopilot simulation to determine the
PAGE 88
73terminal error cost and control delay time. To maintain a basisofcomparison, the Kalman Filter parameters were not modified for this controller. Appendix E contains initial conditions for the controller and estimator dynamics. During the initialization sequence (safety delay) for a given run, time varying fixed-final-time LQG regulator gains are calculated (via 36) based on the initial estimateofthe time togo.Both high and low roll control gains were computed. These solutions used the complete Riccati recursion and cost based on the sampled data system, included a penalty on the final state (to control transient behaviorastgo approaches zero), and allowed for non-coincident sample and control. Given an estimated tgo, at each time t, the Command GeneratorITracker computed the reference position and required roll angle that leads to an intercept without additional control input. The highorlow roll control gain was selected based on the mode. Then the precomputed gains (that are a functionoftgo) are used with the state and correlated disturbance estimate from the filter, roll control zone, and the reference r to generate the control (whichisapplied only to the missile system). Becauseofsymmetry, the tracker gain for the state term equaled the reference gain,sothat, in effect, except for the correlated noise, the current difference between the state x and the reference r determined the control value. During the intercept, between sample times when the stateisextrapolated by the filter dynamics, tgo was calculated based on this new extrapolation and appropriate gains used. This technique demonstrated better performance than using a constant control value over the durationofthe sample interval and justified the computational penalty of the continuous discrete implementationofthe controller and filter.
PAGE 89
MeasurementNoise InterceptGeometry--Sensors--TargetControl--Disturbances TargetDynamics--OutputFunction+ l' Command -ZeroOrder GeneratorI .. Hold -DIATrackerStateEstimationSample&Hold AIDSensorsReachableSetControllerContinuous-DiscreteKalmanFilterMeasurementNoise Autopilot--Actuators Missile MissileDynamics-OutputFunctionDynamicDisturbancesFigure 6.4 Target Missile System
PAGE 90
r-Reference ModelReferenceVariableDynamicsa....Missile Model Yr(t)e(t) ,----,LJ-DynamicDisturbancesMeasurementNoise ", G1G3u(ti)I---{ "')-------1 G2Missile -TargetSystem X(t)Q!LJi Z(t)Yx(t)X(ti)TargetManeuverModelA(ti)ContinuousDiscreteKalmanFilter-Figure 6.5 Command Generator / Tracker
PAGE 91
CHAPTER VII RESULTS AND DISCUSSIONAsan additional reference, before comparing the resultsofReachable Set Control to the baseline control, consider anair-to-airmissile problem from [13]. In this example,the launch directionisalong the lineofsight, the missile velocityisconstant, and the autopilot response to commandsisinstantaneous. The controller has noisy measurementsoftarget angular location, a priori knowledgeofthe time to go, and stochastically models the target maneuver. Even with this relatively simple problem, the acceleration profile increases sharply near the final time. Unfortunately, this acceleration profileistypical, and has been observed in all previous optimal control laws. Reachable Set Control fixes this problem. c.9-...oa:)_u'"u u0oen--....t= .EenE0=: 300 200100o6Time to go (sec) O'--....L------l-.---l 12Figure7.1RMS Missile Acceleration76
PAGE 92
77Simulation The performanceofReachable Set Control was determined via a highfi-delity Bank-To-Turn simulation developed at the UniversityofFlorida and used for a numberofprevious evaluations. The simulationisbased on the coupled non-linear missile dynamicsofchapter II equations (1)to(8) andisa continuous-discrete system that has the capabilityofcomparing control laws and estimators at any sample time. In addition to the non-linear aerodynamic parameters, the simulation models the Rockwell Bank-To-Turn autopilot, sensor (seeker and accelerometer) dynamics, has a non-standard atmosphere, and mass modelofthe missile to calculate time-varying momentsofinertia and the missile specific acceleration from the time varying rocket motor. Figure 7.2 presents the engagement geometry and someofthe variables used to define the initial conditions.Missiley,ZTarget G ax D ..... U9 ;./\j! 9Figure 7.2 Engagement Geometry
PAGE 93
78The simulated targetisa three (3) dimensional, nine (9)"g"maneuvering target. Initially, the target trajectoryisa straight line. Once the range from the missile to targetisless than 6000 feet, the target initiates an instantaneous 9"g"evasive maneuver in a plane determined by the target roll angle, an input parameter.Ifthe launch rangeiswithin 6000 feet,the evasive maneuver begins immediately. Thereisa.4second "safety" delay between missile launch and autopilot control authority. Trajectory Parameters The performanceofthe control laws was measured with and without sensor noise using continuous and sampled data measurements. The integration step was .005 seconds and the measurement step for the Extended Kalman Filter was.05seconds. The trajectory presented for comparison has an initial offset angleof40 degrees (tPg) and180degree aspect (tPa), and a target rollof90 degrees away from the missile. This angleoffand target maneuverisoneofthe most demanding intercept for a preferred axis missile since it must roll through 90 degrees before the preferred axisisaligned with the target. Other intercepts were run with different conditions and target maneuvers to verify the robustnessofReachable Set Control and the miss distances were similar or less that this trajectory. Results Deterministic Results These results are the best comparisonofcontrol concepts since both Linear Optimal Control and Reachable Set Control are based on assumedCertainty Equivalence.
PAGE 94
79Representative deterministic results are presented in Table7.1and Figure 7.3. Figures A.l throughA.9present relevant parameters for the 4000 foot deterministic trajectories. Table7.1Deterministic Control Law Performance Initial Control Time Miss Range Distance (feet) (sec) (feet) 5500 Baseline 2.34 8 Reachable 2.34 6 5000 Baseline2.2113Reachable2.21104800 Baseline 2.1715Reachable 2.17 4 4600 Baseline 2.13 29 Reachable 2.13 6 4400 Baseline 2.0638Reachable 2.08 7 4200 Baseline 2.0235Reachable 2.05 5 4000 Baseline1.9854Reachable 2.00133900 Baseline1.9843Reachable1.988 3800 Baseline1.9840 Reachable1.988 3700 Baseline 2.02 44 Reachable 1.99103600 Baseline 1.99136Reachable 1.9965
PAGE 95
80MissDistance(feet) BaselineGuidanceLaw ............./ SetControl140 f-120 100 80 f-60-40 20 e-j500 40004500Initial Range50005500Figure7.3Deterministic Results An analysisoftrajectory parameters revealed that oneofthe major performance limitations was the Rockwell autopilot. Designed for proportional navigation with noisy (analog) seeker angle rates, the self adaptive loops in the autopilot penalized a high gain control law suchasReachable Set Control. This penalty prevented Reachable Set Control from demonstrating quicker intercepts and periodic control that were seen with a perfect autopilot on a similar simulation used during the research. However, even with the autopilot penalty, Reachable Set Controlwasabletosignificantly improve missile performance near the inner launchboundary. This verifies the theoretical analysis, since thisisthe region where the target set errors, control constraints, and short run times affect the linear law most significantly.
PAGE 96
81Stochastic Results Stochastic performance was determined by 100 runsateach initial condition.Atthe terminationoftherun,the miss distanceandTimeofFlight (TOF) was recorded. During eachofthese runs, the estimatorandseeker (noise)errorsequences were tracked. Both sequences were analyzed to insure gaussian seeker noise, and an unbiased estimator (with respect to each axis).Fromthe final performance data, the meanandvarianceofthe miss distance was calculated. Also, from the estimator and seeker sequences, the root mean square (RMS)errorand varianceforeach run was determined toidentifysome general characteristicsofthe process. The averageofthese numbersispresented. Care must be taken in interpreting these numbers. Since the measurement errorisafunctionofthe trajectoryaswell as instantaneoustrajectoryparameters, a singlenumberisnot adequate to completely describe the total process. Table 7.2andFigure 7.4 present average results using the guidance laws with noisy measurementsandthe Kalman filter.MissDistance(feet)250 200150 100BaselineGuidanceLaw..,..' ., .....50 ReachableSetControl S500 40004500Initial Range50005500Figure 7.4 Stochastic Results
PAGE 97
82 Table 7.2 Stochastic Control Law Performance Initial Control Time Miss Distance RMS Error Range Mean VarianceEKFSeeker (feet) (sec) (feet) (feet)(deg) 5500 Baseline 2.38 273 11470111.3Reachable2.41832677111.55000 Baseline 2.23 240 7874111.4Reachable 2.25 89 3937101.64800 Baseline 2.181937540101.4Reachable 2.191143708101.64600 Baseline 2.13 172 5699101.4Reachable 2.131071632101.54400 Baseline 2.081294324101.5Reachable 2.07 851421101.64200 Baseline 2.041233375101.6Reachable 2.03 62 673101.74000 Baseline2.011052745101.7Reachable2.01661401101.83900 Baseline 2.001053637101.8Reachable 2.00794356101.83800 Baseline 2.001245252101.8Reachable 1.9910510217102.0 3700 Baseline1.981595240101.8Reachable1.9817613078 91.83600 Baseline1.95230 6182101.7Reachable1.95239 14808101.7
PAGE 98
83The first runs made with Reachable Set Control were notasgoodasthe results presented. Reachable Set Control was only slightly (10to20feet) superiortothe baseline guidance law and was well below expectations. Yet, the performanceofthe filter with respect to position error was reasonable, manyofthe individual runs had miss distances near20feet,andmostofthe errors were in the Z axis. Analyzing several trajectories from various initial conditions ledtotwo main conclusions. First, the initial and terminal seeker errors were quite large, especially comparedtothe constant 5 mrad tracking accuracy assumed by many studies [2,3,18]. Second, the non-linear coupled natureofthe preferred axis missile, combined with range dependent seeker errors, and the system (target) model, makes the terminal performance a strong functionofthe particular sequenceofseeker errors. For example, Figure 7.5 compares the actual and estimated Z axis velocity (Target Missile) from a single 4000 foot run. The very first elevation measurement generated a14foot Z axis position error. A reasonable number considering the range. The Z axis velocity error, however, was quite large, 409 feet per second, and never completely eliminated by the filter. Recalling that the target velocityis969 feet per second,isapproximately co-altitude with the missile and maneuvers primarily in the XY plane, this errorissignificant when compared to the actual Z axis velocity(2feet per second). Also, thisisthe axis that defines the rollangle error and, consequently, roll rateofthe missile. Errorsofthis magnitude cause the primary maneuver planeofthe missile to roll away from the target limiting (via the constraints) the abilityofthe missile to maneuver. Further investigation confirmed that the filter was working properly. Although the time varying noise prevents a direct comparison for an entire trajectory, these large velocity errors are consistent with the covariance ratios
PAGE 99
84 in [41]. The filter model was developed to track maneuvering targets. Thepen-alty for tracking maneuvering targetsisthe inability to precisely define allofthe trajectory parameters (ie. velocity). More accurate (certain) models track better,butrisk losing track (diverging) when the target maneuvers unexpectedly. The problem with the control then, was the excessive deviations in the velocity. To verify this, the simulation was modifiedtouse estimatesofposition,buttouseactual velocities. Figure 7.6 and Table7.3has these results.Asseen from the table, the control performanceisquite good considering the noise statistics and autopilot.ZAxis Velocity(feet/sec)600400200 Actual, 0-200 -400 -6000 0.5 11 .5 2TimeFigure7.5MeasuredvsActual Z Axis Velocity
PAGE 100
85 Table 7.5 Stochastic ControlLawPerformance Using Actual Velocities Initial Control Time Miss Distance RMSErrorRange Mean VarianceEKFSeeker (feet) (sec) (feet) (feet) (deg) 5500 Baseline 2.35 34 318101.5 Reachable 2.3738504II1.5 5000 Baseline 2.21 50 506101.7Reachable 2.2338623101.6 4800 Baseline 2.1755367101.6 Reachable 2.19 48 526101.54600 Baseline 2.1261366101.6 Reachable 2.1452326101.6 4400 Baseline 2.06 62 333101.7 Reachable 2.1045415101.7 4200 Baseline 2.02 60 277101.7 Reachable 2.06 44 463101.84000 Baseline 1.98 62 330101.9 Reachable 2.0253lOll9 1.9 3900 Baseline 1.9855436101.9 Reachable 2.00 64 2052101.9 3800 Baseline 1.98 53 400 9 2.0 Reachable 1.99912427101.8 3700 Baseline 1.99 63 235102.0 Reachable 1.991403110101.7 3600 Baseline 1.98138354101.8 Reachable 1.96 213 4700101.6
PAGE 101
86MissDistance(feet)25020015010050 ,. ReachableSetControlBaseline GUidance/ ................................................... -------_...------9500 40004500InitialRange5000 5500Figure 7.6 Performance Using Position Estimates and Actual Velocities While the long term solution to the problemisa better target model that will accommodate both tracking and control requirements, the same model was used in order to provide a better comparison with previous research. For the same reason, the Kalman filter was not tuned to function better with the higher gain reachable set controller. However, target velocity changes used for the generationofthe roll angle error were limited to the equivalentofa 20 degree per second target turn rate. This limited the performance on a single run, but precluded the 300 foot miss that followed a 20 foot hit.
PAGE 102
87Conclusions Reachable Set ControlAsseen from the Tables7.1and 7.2, Reachable Set Controlwasinherently more accurate than the baseline linear law, especiallyin the more realistic case where noiseisadded and sampled data measurements are used. In addition, Reachable set control did insure an initially constrained trajectory for controllable trajectories, and required minimal accelerations duringthe terminal phaseofthe intercept. Unlike previous constrained control schemes, Reachable Set Control was better able to accommodate tinmodeled non-linearities and provide adequate performance with a suboptimal sampled data controller. While demonstrated with a Preferred Axis Missile, Reachable Set Controlisa general technique that could be used on most trajectory control problems. Singer Model Unmodified, the Singer model provides an excellent basis for a maneuver ing target tracker, but itisnot a good model to use for linear control. Neither control law penalized velocity errors. In fact, the baseline control law did not define a velocity error. Yet the requirementsoflinearity, and the integration from velocity to position, require better velocities estimates than are provided by this model (for this quality seeker).
PAGE 103
APPENDIXASIMULAnONRESULTS
PAGE 104
YPosition(feet)40003000 20001000o0 13 footmiss 100020003000XPosition(feet)FigureA.lXYMissile&Target Positions Reachable Set Control4000YPosition(feet)40003000 200054 feet 1000Behind Targeto0100020003000XPosition(feet)Figure A.2XYMissile&Target Positions Baseline Control Law894000
PAGE 105
90
PAGE 106
91Acceleration(g)10050 -o-50 f-t >1009CommandedAcceleration-100o0.51Time1 .52FigureA.SMissile Acceleration Reachable Set ControlAcceleration(g)10050o-50 I>1009CommandedAcceleration-100o0.51Time1 .52Figure A.6 Missile Acceleration Baseline Control Law
PAGE 107
92Roll Rate (deg/sec)6004002000-200-400 -60000.51Time...1.52Figure A.7 Missile Roll Commands&Rate Reachable Set ControlActualRoIIRate(deg/sec)600400200o-200 -400Commanded '.:-600o0.51Time1 .52Figure A.8 Missile Roll Commands&Rate Baseline Control Law
PAGE 108
93Roll AngleError(deg)50o-50o0.51Time1 .52Figure A.9 Missile Roll Angle Error
PAGE 109
APPENDIX B SAMPLED-DA TA CONVERSION System Model The system model for the formulationofthe sampled-data control law consistsofthe plant dynamics with time correlated and white disturbances: [ ]=[F(t) ][X(t)]+[GO(t)] u(t)+T(t) 0 N(t) T(t) the reference: r(t)=A(t)r(t)+B(t)a(t) with the tracking error:[ L0][WS(t) ](1)OMWt(t)(2)e(t)=[yx(t) -Yr(t)]=[H(t)I0I-C(t) ] )(3)r(t)andquadratic cost Js(t)=E{Jd(t)II(t)}, where(4)E{ws(t)ws(t)T}=Ws(t)isthe strengthofthe system (white noise) disturbances to be rejected. E{wt(t)Wt(t)T}=Wt(t) is an input to a stationary first order Gauss-Markov process that models target acceleration (see CH VI). The following assumes a constant cycle time that defines the sampling interval and a possible delay between sampling and controlof =ti' ti94
PAGE 110
9SThecomponentsofthe controlled system arex(t)=x(t)y(t)Z(t)0(t)x(t)T(t)= [::::: ]Tz(t) r(t)(5)o0u(t)y(t) z(t)0(t)Nx(t)Ny(t)Nz(t)P(t)e(t)x(t)y(t) z(t)0(t)ry(t) rz(t)r0(t)rx(t) ry(t) rz(t)r0(t)o0o0In block form, withappropriatedimensions, the system matrices are F(t) = A(t) =F=[0I]OwG(t)=B(t)=G= (6)H(t)=C(t)=[Ihw ]i=j=8 otherwise{+wi=8,j=4(Iw)ij=I otherwise L(t)=I(hw)jj={ M(t)=Ii=j=8 otherwise
PAGE 111
96Sampled-dataEquationsSystemFromthecontinuoussystem,thediscrete-timesampled-datasystemmodelcanbesummarizedinthefollowingsetofequations:y(ti')=H x(ti)+D u(ti)ForatimeinvariantFwithaconstantsampleinterval:
(t .t)-eF(t..-1.)eF(.lt) with .lt -ttxHI'1-HI1-HI-1I000 .lt 00 0 0I00 0 .ltO 0 00I0 0 0 .ltO 0 00I0 0 0 x 48x(.lt) =0 000I000 0 0000I00 0 000 00I0 0 000 000 x 88 with x 48(.lt) = (l!w(.lt)})x 88(.lt) =exp{ -w>(.lt)} (7) (8)
PAGE 112
97Also, since G and H are time invariantandthe u(ti) are piecewise continuous, theinputandoutputmatrices (allowing fornon-coincidentsamplingandcontrol) arefi+lG= cP(ti+l,r)G(r)dr (9)t.1H=H cP(ti',ti) (10) r D=H (11)t.1Therefore, At 2j2 00 0 0 At2j2 0 0 00 At 2j2 0 0 00 At-cPx4S(At) G= At 0 0 0 (12) 0 At 0 0 0 0 At 0 0 0 0 l-cPxSS(At) I00 0 At' 000 0I00 0 At' 00H=00I00 0 At' 0(13)000I0 0o cPX4S(At') 00 000 000 0 0 0 00 000 0 0 0 00 000 00 0 00 0o cPXSS(At')At,2j2 00 0 0 At,2 j 2 0 0 00 At,2j2 0 0 00 At' -cPX4S(At') D= At' 00 0 (14) 0 At' 0 0 0 0 At' 0 0 0 0 l-cPxSS(At')
PAGE 113
98Target DisturbanceFortime correlated disturbances, Wt(t)=Wt=2 at2 ITc where at is the RMS valueofthe noise process T(,.), and T c is the correlation time.N(t)=-(1/Tc)[I]The sampled data disturbanceis(15) (16)Wt(ti)isa sequenceofzero mean mutually uncorrelated random variables, and is givenby:with =T c(l the sampled data impact on the system.(17)
PAGE 114
100 C=C
PAGE 115
APPENDIX C SAMPLED DATA COST FUNCTIONS Assume that the performance objectiveisto minimize an appropriate continuous-time quadratic cost.(1)where J sisthe stochastic cost, Jd a deterministic cost, and I(t)isthe information set available at time t. Let Jd(t)=erT Prer +rr(e(t)TQ(t)e(')+u(t)TR(t)u(tdT to (2)Dividing the intervalofinterest into N+l control intervals for discrete time control, Jd(t)=e(tn+l)TPfe(tn+l)N Jti+1 + [t{e(t)TQ(t)e(t) + u(t)TR(t)u(t)}dr]1whereforall tE [tj,ti+l)' u(t)=u(ti). Substituting the following for e(t):(3)e(t)= + t.1+Jt ti101(4)
PAGE 116
102thedeterministiccost becomes: (5)+N E [i=OJti+1 [ [4>( t,ti)e(ti)tI+{Jt 4>(t,r)G(r)dr}u(ti) t.1+Jt {4>(t,r)L(r)dr}w(ti)]T t.1+{It 4>(t,r)G(r)dr}u(ti) t.I+{It 4>(t,r)L(r)dr}w(ti)] t.1Makingthefollowingsubstitutions:(6){It 4>(t,r)G(r)dr}T t.1Q(t) {Jr 4>(r,ti)G(r)dr}]dt .t1(7)J'i+l r'l;(t,T)L(T)dT} TQ(t) Www(ti)=[{t.t.I 1.{r 'l;(t,T)L(T)dT}]dt t.1 f+l4>(t,ti) T 1: Wxu(ti)=[Q(t){ 4>(t,r)G(r)dr}]dt t.1 1(8) (9)
PAGE 117
103fi+1J: Wxw(ti)=[ t.1iWWX(ti)=[f+'( f T t.t.1 1WUW(ti)=[fi+1( f T Q(t)t.t.1 1( f t.1WWU(ti)=[fi+'( f T Q(t)t.t.1 1( f t.1And,allowingforthefactthatforall tE[ti,ti+l):u(t)=u(ti) e(t)=e(ti) w(t)=w(ti)thedeterministiccostcanbe expressed as:Jd=e(tN+ 1)1r Pfe(tN+l)+N E (e(ti) 1r Wxx(ti)e(ti) + u(ti) 1r Wuu(ti)u(ti)i=O+ W(ti)1rWWW(ti)W(ti) + 2e(ti)1rWXU(ti)U(ti) + 2e(ti)1rWXW(ti)W(ti) +2u(ti)1rWUW(ti)W(ti)}(10) (11) (12) (13) (14) (15) (16)
PAGE 118
104RecallthatJs(t)=E{Jd(t)II(t)}andthatw(.,.) is zeromeananduncorrelatedwitheithereoru. Sinceiftwo variables x&yareuncorrelatedthenE{x,y}=E{x}E{y}.Therefore,E{2e(ti)T Wxw (ti)W(ti)}=2E{e(ti)T w (ti)}E{w(ti)}=0 E{2u(ti)TWxw(ti)W(ti)}=2E{u(ti)T w (ti)}E{w(ti)}=0and(17)Jt t.1(18)Jti+1=tr[Q(t){t.1Jt s(r)L(r) T Tdr}dt)t.1(19)Consequently,forconsiderationinJ sJd=e(tN+l)TPf e(tN+l)N+ E (e(ti)TWxx(ti)e(ti) + U(ti)TWuu(ti)U(ti) +Jw(ti)1=0(20)Wxu(ti) Wuu(ti) (21)
PAGE 119
105Forsmall sample times, the weighting functionscanbeapproximatedby:WxxQ(t).M Wxu(I/2)Q(t)G(t)(.6.t)2 Wuu-[R(t)+( I13)G( t) T Q(t)G(t)(.6.t)2](.6.t) (22)(23)(24)Inorderto relate the valuesofthe sampleddataweighting terms on the systemerrorto the state variables (using discrete variable notation), let: P(ti)=[H 0 -C]Tpr[ H0-C](25)Q(ti)=[H 0-C]T Wxx(ti)[ H0-C](26)R(ti)=Wuu (27) S(ti)=[H 0-C]T Wxu (28)Therefore,withthe augmented state variableX=[xTr ]T:ExpandingQandST](29)[ -CTWXX(ti)Hoo o ] CTWXX(ti)C(30)
PAGE 120
106With non-coincident sampling and control, the penalty terms are modifiedasfollows: e(ti')=[H0-C] +rti' This modification adds additional terms:[DE][U]ati'(31 ) (32){[HOC] +[DE][U]}T.r ti' a ti'C] [DE][U]}r ti' a ti' AgainwithWxx(ti)=Wxx(ti') and Wxu(ti')=Wxu(ti), the additional terms can be grouped withRandS to generate:R(ti')=R(ti)+DTWxx(ti)D+DTWxu(ti) S(ti')=S(ti)+DTWxx(ti)[ H0-C](33) (34)
PAGE 121
APPENDIXDLQGCONTROLLERDECOMPOSITIONThecombinationofthesystem, targetdisturbance,andthereference, result in a19state controller.However,thedecoupledstructure,symmetry,andthe zerosinthe controlinputandcost matricescanbeexploitedto streamline the calculations. In general, theoptimalsolution to theLQGtrackercanbe expressed as: whereand G*(ti) =[R(ti) +GT(ti)P(ti+l)G(ti)rl [GT(ti)P(ti+l)cI>(ti+l,ti) + ST(ti)] P(ti)=Q(ti) + cI>T(ti+}.ti)P(ti+l)cI>(ti+l,ti) [GT(ti)P(ti+l)cI>(ti+}.ti) + ST(ti)]TG*(ti)(I)(2)(3)Toreduce thenumberofcalculations,partitionthe gainandRiccatiequationsuchthat: Evaluating terms:(4)GT (ti)P( ti+ 1 )G(til=[ =[=[GT10I0][PIIPl2P13 ][G]P21P22 P23 0P31P32 P33 0GTPIIIGTPI2IGTP13J [g] GTPIIG]107(5)
PAGE 122
108Therefore,Now consider, GT (ti)P(ti+ 1 I ,ti) 0=[GT1010][PIIP12P13j 00P21P22 P230 0P31P32 P3300 0=[GTPIIIGTPl21GTP13 ] 00 0 0 00 (6) (7)[ I + I oFromAppendix C, S=[H0-C]TWxu Substituting, the required terms for the gain computation become:,ti) + ST(ti))= + WxuTHI + o(8)Consequently, with(9)the optimal control can be expressedas:G1(ti)=GI + WxuTH] G2(tj)=GI + 0G3(ti)=GI WxuTC](10)(11)(12)
PAGE 123
109Partitioningequation3:[PHPI2Pl3] [ 0-HTWXX('i)C](13) P21 P22 P23=0 (ti) P31 P32 P33(ti)-CTWXX(ti)H 0 4>n 0 T 4>n 0 4>x [ PI IP12Pl31 4>x 0 0 0 0 + 0 4>T 0 P21 P22P23 0 4>T 0 0 0 4>r P31 P32P33 (ti+ 1) 0 0 4>r4>n 0 -([ GTI0101[PIIPI2Pl3] 4>x 0 0P2IP22 P23 0 4>T 0P3IP32 P33ti+I)0 0 4>r +[WxuTHI01-WxuTC]}T [GIlG21G3]=[ -CTWxXCti)Ho oo ] CTWXX(ti)C (14)+ {4>xTPll}{[4>n TOJPll+ 4>xTP2t>{4>rTP3t>{4>x TPI2} {4>x TPI3} {[4>n T OJPI2+4>TTP22}{[4>n T 0]P13+4> T T P23 }{4>rTP32}{4>rTP33}[;X4>n )('i+l)0 4>T 0 GTPI2I GTPl31 [;x4>nil -{[GTPIII0 4>T 0 + [WxuTHI0I-WxuTCnT [GIIG21G3]
PAGE 124
= -cT Wxx(ti)H 110 g oCTWXX(ti)C (15) o + o o TH}T [GIIG21G3]o (ti+l)ForthepropagationoftheRiccatiequationonly these termsarerequiredtogeneratethecontrol gain:Pll(ti)=HTWxx(tj)H + TH}T][GIlPI2(ti)= + o-[[G2] P13(ti)=-HTWxx(ti)C + -(16)(17)(18)
PAGE 125
APPENDIX E CONTROLLER ANDFILTERPARAMETERS Controller Control Delay=.21seconds Target Maneuver Correlation time=.21seconds Riccati initialization (Pf) Linear controller::Pf=IE+2 for sample times<.01secondPf=lEIfor sampletimes>.01second Roll rate controller:Pf=1.0Quadratic Cost Terms (Continuous) Linear Accelerations Q=320.R=l.Roll Control (Angle) (Rate)QIlQ22RHI GAIN IE+lIE-4lE-3LOGAINlE-4lE-3Sample Time=integration (not sample) stepforthe continuous-discrete filter System Disturbance Input=(AT AM) (Tf-TO)2/ 2.III
PAGE 126
112Filter Target Correlation Time = 2.0 seconds Riccati initialization (Pf)Positions:PII= 2500 Velocities: P44 = 2.0E6 Covariance:Pl4= 5.0E4 Maneuver Excitation Matrix (2a2) = 5120000 Seeker Measurement Noise Azimuth&Elevation R= (SITH*SITH/(RNG HAT*RNGHA T)+SOTH*SOTH +SITHI*SITHI*RNG HAT**4)/MEAS ST Range R= (SOR*SOR+SIR*SIR*RNG HAT**4)/MEAS ST Range Rate R= (SODR*SODR+SIDR*SIDR*(RNG HAT**4) )/MEAS ST With SITH = SIPH =1.5SOTH = SOPH = .225E-4 SITHI= SIPHI= 0.0 SOR = SODR = 3.0 SIR =IE-8SIDR = .2E-IO
PAGE 127
LISTOFREFERENCES [1] A. Arrow, "StatusandConcerns for Bank-To-TurnControlofTactical Missiles," GACIACPR-85-01,"Proceedingsofthe Workshop onBank-To-TurnControlled Terminal Homing Missiles," Vol1,Joint ServiceGuidanceandControl Committee, January 1985. [2] J.R. McClendon and P.L. Verges, "ApplicationsofModern Control and Estimation Theory to the GuidanceandControlofTacticalAir-to-AirMissiles," Technical ReportRG-81-20,"Research onFurureArmyModular Missile," USArmyMissile Command, Redstone Arsenal, Alabama, March 1981. [3] N.K. Gupta, J.W. Fuller,andT.L. Riggs, "Modern ControlTheoryMethodsforAdvanced Missile Guidance," Technical ReportRG-81-20,"Research onFurureArmy Modular Missile," USArmyMissile Command, Redstone Arsenal, Alabama, March 1981. [4] N.B. Nedeljkovic, "New Algorithms for Unconstrained Nonlinear Optimal Control Problems," IEEE Transactions on Automatic Control, Vol.AC-26,No.4,pp868-884, August 1981. [5] W.T. Baumann and W.J.Rugh,"Feedback ControlofNonlinear Systems byExtendedLinearization," IEEE Transactions on Automatic Control, Vol.AC-31,No.1,pp. 40-46, January 1986. [6] E.D. Sontag, "Controllability and Linearized Regulation,"DepartmentofMathematics, Rutgers University, New Brunswick, NJ (unpublished),14February 1097.[7]A.E. Bryson and Y.C. Ho, Applied Optimal Control, Blaisdell Publishing Company, Waltham, Massachusetts, 1969. [8] Y.-S. Lim, "LinearizationandOptimizationofStochastic Systems with Bounded Control," IEEE Transactions on Automatic Control, Vol.AC-15,No.1,pp49-52, February 1970. [9] P.-O.GutmanandP.Hagander,"ANew DesignofConstrained ControllersforLinearSystems," IEEE Transactions on Automatic Control, Vol.AC-30,No. 1,pp22-33, January 1985. [10] R.L. Kousut, "Suboptimal ControlofLinearTime-InvariantSystems Subject to Control Structure Constraints," IEEE TransactionsonAutomatic Control, Vol.AC-15,No.5,pp 557-562, October 1970.113
PAGE 128
114[11] D.J. Caughlin, "Bank-To-TurnControl," Master's Thesis, UniversityofFlorida, 1983. [12] J.F. Frankena and R. Sivan,"Anon-linear optimal control law for linear systems," INT. J. CONTROL, Vol 30, NoI,pp159-178, 1979. [13] P.S. Maybeck, Stochastic Models. Estimation. and Control, Volume 3, Academic Press, New York, 1982. [14] S.A. Murtaugh and H.E. Criel, "FundamentalsofProportional Navigation," IEEESpectrum, pp.75-85, December 1966. [15] L.A. Stockum, and I.C. Weimer, "Optimal and Suboptimal Guidance for a Short Range Homing Missile," IEEE Trans. on Aerospace and Electronic Systems, Vol. AES-12,No.3,pp 355-361, May 1976. [16]B.Stridhar, and N.K. Gupta, "Missile Guidance Laws Based on Singular Perturbation Methodology," AIAA JournalofGuidance and Control, Vol 3,No.2,1980. [17] R.K. Aggarwal and C.R. Moore, "Near-Optimal Guidance LawforaBank-To-TurnMissile," Proceedings 1984 American Control Conference, Volume 3, pp. 1408-1415, June 1984. [18] P.H. Fiske, "Advanced Digital Guidance and Control Concepts forAir-To-AirTactical Missiles,"AFATL-TR-77-130,Air Force Armament Laboratory, United States Air Force, Eglin Air Force Base, Florida, January 1980. [19] USAF Test Pilot School, "Stability and Control Flight Test Theory,"AFFTC-77-I,revised February 1977. [20] L.C. Kramer andM.Athans, "On the ApplicationofDeterministic Optimization MethodstoStochastic Control Problems," IEEE Transactions on Automatic Control, Vol. AC-19,No.I,pp22-30, February 1974. [21]Y.Bar-Shalom and E. Tse, "DualEffect,Certainty Equivalence, and Separation in Stochastic Control," IEEE Transactions on Automatic Control, Vol. AC-19,No.5,pp 494-500, October 1974. [22] H. Van DE Water and J.C. Willems, "The Certainty Equivalence Property in Stochastic Control Theory," IEEE Transactions on Automatic Control, Vol. AC-26,No.5,pp1080-1086, October 1981. [23] "Bank-To-TurnConfiguration Aerodynamic Analysis Report" Rockwell International Report No. C77-142Ij034C, date unknown.
PAGE 129
115[24] D.E Williams andB.Friedland, "DesignofAn AutopilotforBank-To-TurnHoming Missile Using Modern Control and Estimation Theory," ProcFifthMeetingofthe Coordinating Group on Modern Control Theory, 15-27 October 1983, (Picatinny) Dover, New Jersey, October 1983, pp. 397-419. [25]B.Friedland,etaI., "On the "Adiabatic ApproximationforDesignofControl Laws for Linear, Time-Varying Systems,"IEEETransactions on Automatic Control, Vol.AC-32,No.1,pp. 62-63, January 1987. [26] D.W. Tufts and D.A. Shnidman, "Optimum Waveforms Subject to Both Energy and Peak-Value Constraints," Proceedingsofthe IEEE, September 1964. [27] M. Pontier and J. Szpirglas, "Linear Stochastic Control with Constraints, IEEE Transactions on Automatic Control, Vol.AC-29,No. 12, pp 1100-1103, December 1984. [28] P.-O.GutmanandS.Gutman,"ANote on the ControlofUncertain Linear Dynamical Systems with Constrained Control Input," IEEE Transactions on Automatic Control, Vol. AC-30,No.5,pp 484-486, May 1985. [29]M.W.Spong, J.S. Thorp, and J.M. Kleinwaks, "The ControlofRobot Manipulators withBounded Input," IEEE TransactionsonAutomatic Control, Vol. AC-31,No.6,pp 483-489, June 1986. [30]D.Feng and B.H. Krogh, "Acceleration-Constrained Time Optimal Control in n Dimensions," IEEE Transactions on Automatic Control, Vol. AC-31, No. 10, pp 955-958, October 1986. [31] B.R. BarmishandW.E. Schmitendorf, "New Results on ControllabilityofSystemsofthe Form x(t)=A(t)x(t)+F(t,u(t,"IEEETransactions on Automatic Control, Vol. AC-25,No.3,pp 540-547, June 1980. [32] W.-G. HwangandW.E. Schmitendorf, "Controllability Results for Systems with a Nonconvex Target," IEEE TransactionsonAutomatic Control, Vol. AC-29,No.9,pp 794-802, September 1984. [33] T. Kaliath, Linear Systems, Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1980. [34] M. Vidyasager, Nonlinear Systems Analysis, Prentice-Hall, Englewood Cliffs, New Jersey, 1978. [35] K. Zhou, and P. Khargonekar, "Stability Robustness Bounds for Linear State-Space Models with Structured Uncertainity," Transactions on Automatic Control, Vol.AC-32,No.7,pp 621-623, July 1987. [36] K.G. Shin and N.D. McKay, "Minimum Time ControlofRobotic Manipulators with Geometric Path Constraints," IEEE TransactionsonAutomatic Control, Vol. AC-30,No.6,pp 531-541, June 1985.
PAGE 130
116[37] A. Sabari and H. Khalil. "Stabilization and RegulationofNonlinear Singularly Perturbed Systems Composite Control," IEEE Transactions on Automatic Control, Vol.AC-30,No.8,pp. 739-747, August 1985. [38] M. Sampei and K.Furuta,"On Time Scaling for Nonlinear Systems: Application to Linearization." IEEE Transactions on Automatic Control, Vol.AC-31.No.5,pp 459-462, May 1986. [39] I.J. Ha and E.G. Gilbert."AComplete CharacterizationofDecoupling Control Laws for a General ClassofNonlinear Systems," IEEE Transactions on Automatic Control, Vol.AC-31,No.9,pp. 823-830. September 1986. [40] H. K wakernaak and R. Sivan, Linear Optimal Control Systems, Wiley-Interscience. New York, 1972. [41] R.A. Singer. "Estimating Optimal Tracking Filter Performance for Manned Maneuvering Targets," IEEE Transactions on Aerospace and Electronic Systems, Vol. AES6,No.4,pp 473-483, July 1970. [42] M.E Warren and T.E. Bullock, "DevelopmentandComparisonofOptimal Filter Techniques with Application toAir-to-AirMissiles," Electrical Engineering Department. UniversityofFlorida, Prepared for the Air Force Armament Laboratory, Eglin Air Force Base, Florida, March 1980.
PAGE 131
BIOGRAPHICAL SKETCH Donald J. Caughlin, Jr., was born in San Pedro, California on17Dec. 1946. He graduated from the United States Air Force Academy, earning aB.S.in physics, and chose pilot training insteadofan Atomic Energy Commission Fellowship. Since then he has flown over 3100 hours in over 60differentaircraft and completed one tour in Southeast Asia flying theA-ISkyraider. A Distinguished Graduateofthe United StatesAirForce Test Pilot School, Don has spent muchofhis career in research, development,andtest at both major test facilities--Eglin AFB in Florida, and Edwards AFBinCalifornia. Don Caughlinisa Lieutenant Colonel in theUnitedStatesAirForcecurrently assigned as the Assistant for SeniorOfficerManagement at HeadquartersAirForce Systems Command. In addition to theB.S.in physics,Lt.Colonel Caughlin has an M.B.A. from the UniversityofUtah, and a masters degreeinelectrical engineering from the UniversityofFlorida. Heisa memberofthe SocietyofExperimental Test Pilots and IEEE. Lt. Colonel Caughlinismarried to theformerBarbara SchultzofMontgomery, Alabama. They have two children, a daughter, Amy Marie, age eight, and Jon Andrew, age four.117