Assessment of Design Impacts on Space Shuttle Operations and Recommendations for Manned Space Vehicle Programs

Page created by Ronald Ingram

Business

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Assessment of Design Impacts on Space Shuttle Operations and Recommendations for Manned Space Vehicle Programs

Assessment of Design Impacts on Space Shuttle Operations
and Recommendations for Manned Space Vehicle Programs
                                      Thomas Coffee
                                      Department of Aeronautics & Astronautics
                                      Massachusetts Institute of Technology

             January Operational Internship Experience
                   NASA Kennedy Space Center
               Space Shuttle Operations Directorate

Acronyms
AC ...............Alternating Current
AFB .............Air Force Base
BSTRA ........Ball Strut Retention Assembly
CAIB ...........Columbia Accident Investigation Board
CG ...............Center of Gravity
CPU .............Central Processing Unit
DC ...............Direct Current
DFRC ..........Dryden Flight Research Center
ET ................External Tank
FRCS ...........Forward Reaction Control System
GPS..............Global Positioning System
GSE .............Ground Support Equipment
HDTV..........High Density Television
HQ ...............(NASA) Headquarters
HVAC..........High Volume Air Conditioning
ISS ...............International Space Station
IVHM ..........In-Vehicle Health Monitoring
JAXA...........Japan Aerospace Exploration Agency
JOIE.............January Operational Internship Experience
JSC ..............Johnson Space Center
KSC .............Kennedy Space Center
LH2 ..............Liquid Hydrogen
LOX.............Liquid Oxygen
LRU.............Line Replaceable Unit
MLP.............Mobile Launch Platform
MSFC ..........Marshall Spaceflight Center
MSGC..........Massachusetts Space Grant Consortium
MTBF ..........Mean Time Between Failures
NASA ..........National Aeronautics & Space Administration
NSLD ..........NASA Shuttle Logistics Depot
OMM ...........Orbiter Major Modification
OMS ............Orbital Maneuvering System
OPF..............Orbiter Processing Facility
PCR .............Payload Changeout Room
QA ...............Quality Assurance
RFID............Radio Frequency Identification
RSB .............Rudder/Speed Brake
SRB .............Solid Rocket Booster
SRU .............Shop Replaceable Unit
SSC..............Stennis Space Center
SSME ..........Space Shuttle Main Engine
SSTO ...........Single Stage to Orbit
TACANS .....Tactical Air Navigation System
TPS ..............Thermal Protection System
USA.............United Space Alliance
USAF...........United States Air Force
VAB ............Vehicle Assembly Building
VDC ............Volts Direct Current

Cover Image: Space Shuttle Columbia at liftoff [Credit: NASA].

Introduction
As conceived in the early 1970s, the Space Shuttle would launch forty times per year, lowering launch
costs by an order of magnitude versus expendable launchers to ~$1000/kg to Low Earth Orbit. The promise
of a centralized, reusable launch architecture providing the nation cheap access to space captured a winning
bulk of political and public sentiment until Challenger exploded on liftoff in 1986. In the ensuing review of
the program, it became evident that both these estimates outbid reality by roughly ten times: the Shuttle has
since launched roughly four times per year, with full-accounting launch costs somewhat higher than the
expendable launchers it was meant to replace. At the same time, previous idealism about future reusable
launch vehicles has been severely shaken.

During January 2005, I visited NASA’s Kennedy Space Center to examine the operational issues leading to
the Shuttle’s failure as a low-cost launch system and develop recommendations for future reusable vehicle
programs. Over three weeks, I had the opportunity to observe Shuttle operations preceding the Return to
Flight after the Columbia accident, and to interview several engineers and managers from each division of
the NASA Shuttle Processing Directorate and the United Space Alliance. This report outlines the major
themes generated from this experience and suggests possible directions for future human spaceflight
systems.

Overview
To students of the space program, it seems surprising that the Shuttle’s capabilities were so misjudged. At
its inception, the program foreran the world’s most renowned technical organization, the recent champion
of the most spectacular engineering achievements in human history. Yet precisely this euphoric
environment contributed to the Shuttle’s misalignment with its stated objectives.

Performance Optimization Versus Reusability
Veterans of the Apollo program were accustomed to performance-driven, expendable systems only loosely
constrained by budgetary considerations. As a cost-driven, reusable system, the Shuttle required a different
engineering approach, which it did not fully receive. While the vehicle was built to comply with reusability
requirements, it was optimized primarily for flight performance (Figure 1). The high performance
sensitivity of rocket propulsion systems provided some justification for this, but lifetime operational impact
ultimately dwarfed the gains accomplished by this methodology.

Figure 1: The Space Shuttle design optimized flight performance over operations [Credit: NASA].

The Snowball Effect
Can a misplaced balance between performance and operational costs really explain an order-of-magnitude
gap between predictions and reality? A simplistic model provides some insight into how this may occur:
suppose that every unit of some resource involved in the program in turn requires k units of resources in
overhead supporting the associated operations, where k < 1. If we assume the same marginal overhead
factor for these additional resources, we require another k2 units of resources in support, and so on. Hence,
the total overhead factor is   1 + k + k 2 + L = 1−1k . The total resources required in this model show
remarkable sensitivity to k, as shown in Figure 2.

Figure 2: A simple model of overhead propagation illustrates the Snowball Effect.

In reality, of course, the factor k varies at every level of support and for every type of resource. However,
small miscalculations in predicting operational overhead can plausibly account for large multiplicative
deviations in total cost. This Snowball Effect applies to a number of different resources:

• Infrastructure. More equipment requires more supporting infrastructure, which itself
requires more supporting infrastructure.
• Personnel. More people require more facilities and people to support and coordinate them,
which require still more people.
• Processing Time. The longer a process takes, the more tasks are required to maintain health
and organization, adding more time to the process.
• Testing Completeness. The more tests performed, the more parts must be moved to perform
the tests, hence the more parts must be tested.
• Requirements. More requirements generate more interactions with other requirements, which
generate still more requirements.

From the intuition above, small reductions in the marginal overhead factor associated with each of these
resources can produce dramatic savings in total cost.

Three Targets for Operational Costs
As with any partially reusable system, the operational costs of the Space Shuttle program can be
decomposed into a product of three factors: the repairs required, the tasks required to perform these repairs,
and the resources required to perform these tasks (Figure 3). Each of the recommendations presented below
addresses one or more of these elements.

Figure 3: A product of three factors determines total operational costs.

Design Recommendations
Most of the design recommendations discussed herein were suggested by multiple individuals involved in
Shuttle operations; a select few are based purely on personal observations. Quotations come from senior
engineers at NASA or United Space Alliance except where noted. Names of individuals have been
withheld to protect privacy. Many examples discussed illustrate multiple problems; however, most are
discussed only in the context of one most relevant issue.

Trade Performance for Robustness
Cost-effective reusable systems must maintain higher margins on components than one would choose for
expendable systems. The Space Shuttle was built with low margins and a few particularly striking
vulnerabilities. Disasters associated with these vulnerabilities are generally avoided by meticulous
processing and inspection, which drives up processing time and costs. By contrast, some NASA engineers
offered Russian programs as examples of more robust systems with higher margins, which achieved
substantially lower absolute costs in both development and operations. We focus here on a few cases where
small performance hits could make a large difference in servicing requirements:

Flight Deck Windows. The windows of the cockpit encounter micrometeoroids and orbital debris at high
relative velocities, creating dents and cracks during flight. The combined constraints of aerodynamic
heating and material resilience require polishing the windows to a tolerance of 0.0006 inches. Technicians
inspect the 33 surfaces of all ten panes with a hand-held microscope, requiring a dedicated week to inspect
a single surface. In addition, 1-2 panes per flight must be replaced entirely. Relatively low-impact changes
in total window area or materials could substantially reduce the time required for processing. (Figure 4)

Figure 4: Orbiter windows require enormous attention and protection during Shuttle processing.
Left: protective covers shield the windows inside the OPF. Right: cockpit rear windows incur less
damage by virtue of the flight path and payload bay doors, but require equally extensive inspection.

Cold Plates. Many electronic components are cooled via surface-mounted water or freon honeycomb cold
plates, with face sheets not much thicker than aluminum foil. Technicians replace these components, after
servicing, in an awkward position requiring significant force while in contact with the plate. Plates are
often damaged, though verifying damage requires taking molds of potential defects and applying optical
magnification. Replacing the cold plates requires exchanging shipments with facilities in Palmdale,
California. Here, a reasonably lightweight dent-resistant coating could suffice to mitigate the problem.
(Figure 5)

Figure 5: Avionics boxes are mounted on cold plates buried in racks like these. Left: the front
middeck racks contain most Shuttle avionics, where space limitations limit access and safe numbers
of personnel. Right: technicians must operate dexterously yet forcefully inside deep, narrow cavities.

Propulsion Concept. The engineers at USA have suggested that no propulsion system based on a high-
order chemical reaction will become “operational” before revolutionary advances in propulsion technology
are realized, due to the stresses and hazards involved. The Shuttle’s propulsion system was optimized for
chemical specific impulse, creating many issues associated with the extreme demands of cryogenic fuels,
particularly liquid hydrogen (some discussed later).

Main Engines. The Space Shuttle Main Engines (SSMEs) were designed to be operated for several dozen
flights without servicing, but are currently disassembled for testing and reassembled every flight. A change
of contractor has been recently effected in order to address the engines’ robustness.

Figure 6: Space Shuttle Main Engines are disassembled for testing and reassembled every flight.
Left: the stunning entanglement of fluid and electrical lines makes inspection and testing a multi-
month process. Right: removing and replacing engines requires custom-built heavy machinery to
provide the necessary combination of force, precision, and compatible geometry.

Hydraulic Power Lines. Many Orbiter functions draw mechanical power from hydraulic fluid lines
carrying hazardous compounds that invariably spring leaks. Some have proposed replacing these hydraulic
systems with substantially more massive batteries, which also contain hazardous materials but may prove
far more benign for servicing.

Thermal Protection System. The Shuttle’s TPS is so large that it would be easy to expend all remaining
payload mass to make it more robust. However, the brittle heat tiles of the current TPS are so vulnerable to
debris damage and flexion cracking that hundreds require replacement each flight, each requiring roughly

40 work hours. Inspection alone requires hours more for each of the ~32,000 unique tiles on the Orbiter.
Additional protection, while massive compared to payload capacity, would directly impact the critical path
of processing. For instance, the X-33 Advanced Technology Demonstrator program tested metallic Inconel
tiles that proved more robust and more replaceable than Shuttle tiles without severe weight penalties.

Figure 7: The brittle TPS tiles on the Orbiter are highly vulnerable to dents and cracks. Left: one can
make out impact streaks along the flow path and cracking of varying severity on the tiles. Right:
during processing, discrepancy tags pepper the belly of the Orbiter signifying repairs required.

Automate Testing and Verification
The Shuttle was not designed with testing and verification in mind, hence a great deal of disassembly is
required merely to inspect components of the Orbiter. (Many of these processes create additional damage,
as discussed later on.) Because inspections are done manually, they are largely subjective, and therefore
require multiple people to provide cross-checking (Figure 8). The Quality Assurance (QA) division is
tasked with finding ways to perform inspections in a reliable manner: in some cases, products cannot be
readily evaluated, so work processes must be reviewed instead, a much more difficult practice that accounts
for roughly half of all oversight activities. Following the Columbia accident and revisions of inspection
procedures, QA has found itself swamped with training new hires to review inspection checkpoints, more
than doubling its staff. Automated in-vehicle health monitoring (IVHM) systems could improve the
targeting and consistency of inspections, mitigating these problems.

Figure 8: The subjective nature of inspections requires teams of people to check each other’s work.
Left: six people representing different organizations perform the same tactile stiffness test on a tile
segment. Right: a similar team of six performs a visual examination of an OMS pod element.

IVHM could also provide new sources of data on component health and performance, with multiple
applications. First, built-in monitoring could potentially allow some verification to be done in flight,
shortening the processing schedule accordingly. Second, such systems would dramatically improve
performance information available to Shuttle engineers, by monitoring components while the vehicle is
operating in its principal design environment. Because few such measurements are currently available,
particularly on the (newer, less instrumented) surviving Orbiters, the engineers “can’t get smarter,” so the
same design uncertainties remain unresolved indefinitely. The Shuttle’s total data handling capacity
currently limits instrumentation upgrades, an item discussed later on.

We mention an example in each of these categories below:

Landing Gear Deployment. The Orbiter landing gear is cycled dozens of times on the ground each flight
for verification, but only once in flight; hence the majority of wear on the system is the product of
processing. Monitoring the operation of the landing gear in flight would allow a more accurate and lower-
impact assessment of its readiness for subsequent flight. (Figure 9)

Figure 9: The Orbiter landing gear undergoes the vast majority of its cycling on the ground. Left:
hydraulic lines actuate landing gear during processing. Right: hardware and activities surrounding
the nose landing gear require dozens of deployment and retraction cycles during a typical flow.

Vehicle Loading. Computational fluid dynamics techniques were not readily available in the 1970s, and
even today are poorly applicable to hypersonic flight regimes. In-flight measurements of loading profiles
would help engineers determine the accuracy of original design margins, and find potentially significant
areas of risk or room for optimization in the structure. Current measurements of loading come from
interpolating ground-based video footage of the Shuttle in flight, a very low-precision technique.

Revise Verification Requirements
Roughly 75% of Orbiter hardware is extracted and replaced in a typical processing flow. The Orbiter
contains 83,402 line replaceable units (LRUs), of which only 3079 are consumables. 5392 of the LRUs are
shop replaceable units (SRUs), requiring servicing at an external facility, such as the NASA Shuttle
Logistics Depot (NSLD). Roughly 2400 of the LRUs are reparable (the rest are replaced outright); 411 of
these are considered “maintenance-significant items,” a special designation granted to hardware requiring
extensive refurbishment. However, a great deal of Orbiter hardware is removed principally for inspection
and testing.

Verification requirements for Orbiter components are generally specified in terms of flight time or total
time between inspections. These schedules usually derive from failure mode predictions, i.e., the
consequences of breaking a requirement, but failure probabilities are difficult to characterize for two
reasons: first, a lack of data on component failures; second, a lack of failures themselves.

The first issue may be addressed through better in-vehicle health monitoring, as discussed above. The
second issue derives from the current practice of eliminating potential failures before they occur. While this
practice is highly advantageous in the realm of catastrophic failures, it deprives the program of valuable
information about non-catastrophic failures and prevents informed tightening of the inspection process.
Some examples follow:

GPS Receivers. Like many systems on the Orbiter, the GPS system employs both secondary and tertiary
redundant backups. Even if all three operate flawlessly throughout the flight, all three are rechecked and
refurbished again before the next flight. This prevents gathering any data on mean time between failures
(MTBF), potentially a basis for revising the inspection requirement. This seems unnecessary with triple
redundancy, and begs the question: “Can we trust what we just flew?”

Airframe. The mass driver for the airframe is not structural loading, but thermal stiffness. Several
engineers believe the airframe was overdesigned in this regard, and could have been tested with lower
margins without catastrophic failure and reinforced as necessary through subsequent modifications.

Start from Standard Parts and Processes
The Space Shuttle was designed largely from the ground up, generating part specifications based on the
precise requirements derived from performance objectives. As a result, most components of the Shuttle are
highly specialized, used nowhere else. Even commercial parts with appropriate specifications undergo
NASA requalification. While this practice in principle provides high precision and low risk, even these
gains may not be universal; and the disadvantages loom large.

The uniqueness of components demands close relationships with single vendors or other providers to
manufacture each piece of hardware on the Shuttle. This binds the program to particular vendors, whose
individual turbulence can wag the dog in many directions at once. Changes in business may break the
relationship from the vendor side, changes in quality from the NASA side; in either case, the Shuttle
program must incorporate the component into its own manufacturing at the NSLD, or engage another
vendor in an expensive and time-consuming qualification process.

In the current life of the program, vendors assume tremendous risk for small opportunities in producing
Shuttle hardware. Shuttle supply provides a very low-volume business, and requires enormous overhead;
moreover, it opens vendors to the possibility of investigation if problems arise in connection with their
components. As a result, vendors are reluctant to continue producing Shuttle hardware, and even more
reluctant to start. This translates into higher costs for commercially produced hardware, and strong
resistance to Shuttle upgrades despite major advances in technology.

Unique components also create tremendous overhead for inspection and problem resolution. Unlike mass-
produced parts, failure data is virtually nonexistent for most Shuttle components; with no standards for
comparison, policy dictates that most elements fly like new, unlike even high-performance aircraft. When
unexpected failures occur, NASA must initiate prolonged, expensive studies on its own to discover the
source of the problem, since no other entity has an interest in the failure. In some cases, this divergence
from mainstream industry can actually lead to lower reliability. The following provide some examples of
problems attributable in part to the highly customized practices of Shuttle development:

GPS Receivers. The Shuttle GPS system was modified from a standard military system due to errors
introduced by the Shuttle’s high velocities in flight, though alternative workarounds were available. Not
only did the program end up having to hire military personnel to help maintain the variant system, but it
wound up showing poorer reliability than the more mainstream military receivers. The relationship was
further strained when the military was asked to maintain outdated tactical air navigation system (TACANS)
stations at high expense solely for Shuttle use.

Inkonel Bolts. These structural bolts on the Orbiter have developed a longitudinal “fold” due to changes in
a vendor manufacturing process. After 25 years of flight, they are now required to go through a
requalification process to investigate the deviance from requirements. There is no other source for the bolts.

Crawler Shoes. The Space Shuttle Transporter Crawler required replacement tread shoes last year due to
fatigue cracking discovered after decades of wear. Only one foundry in the United States was capable of
producing the new shoes, each made from a ton of specialized molybdenum alloy. (Figure 10)

Figure 10: Only one foundry in the United States could produce the Crawler’s new shoes. Left: the
Crawler’s tread shoes together support 12 million pounds, requiring unparalleled precision
machining and material strength to distribute loads evenly. Right: the new shoes await installation.

Fluid Lines. Cracks discovered in the stainless steel bellows of the Orbiters’ aft fluid lines grounded the
Shuttle fleet during a $100M investigation to determine the cause (engine-induced resonance). They are
now required to be polished to such tight tolerances that “a fly can’t stand on them.”

Ball Joints. The ball strut retention assemblies (BSTRA balls) are very hard ball joints for struts inside the
bellow of engine fuel flex lines. Long-term cracks in these balls also required a $100M investigation.

KU-Band Antenna. The specially developed KU-band antenna is a single-point failure in flight, whose
failure severely compromised one Shuttle science mission. The program considered replacing the failure-
prone antenna with a fixed antenna array, but found the development costs for an additional specialized
component prohibitive.

Can a system with the performance demands of the Shuttle actually be built from standardized,
commercially available parts? NASA and USA engineers answer “absolutely.” Reliability can be achieved
through redundancy and well-calibrated inspections based on the data regularly available for mass-
produced parts. The only challenge is designing the vehicle with available off-the-shelf components in
mind. (Engineers noted one advisable exception to this paradigm: off-the-shelf software, which often
requires extensive modification to meet the needs of a specialized program.)

Design for Servicing
Poor design can make processing activities far more difficult than necessary, stretching schedules and
creating damage leading to further processing requirements. By some estimates, over 70% of Shuttle
processing work is unplanned, about 10% of this attributed to damage and accidents during the processing
activities. As one engineer put it, “we’re our own worst enemy.”

Increase Accessibility
Several major sources of damage are related to restricted access for technicians operating the Orbiter. Parts
optimized for flight loading often cannot withstand minor offences unwittingly committed by servicing
personnel in an effort to get where they need to go. Improved accessibility is the most consistent request
heard from technicians and engineers alike. A few examples stand out:

Structural Support Tubes. The boron-aluminum tubes providing major structural support to the airframe
are highly optimized for tension-compression loading, and thin enough to buckle at the slightest lateral
pressure. They must be inspected and often replaced if they are bumped during processing. (Figure 11)

Figure 11: Boron-aluminum support tubes buckle when bumped during processing. Left: technicians
crawl inside the Orbiter midbody through panels like the one at lower right. Right: they can easily
bump against structural support tubes like the green one at lower left.

Wiring. The Orbiter contains ~180 linear miles of wiring, which received low priority in component
geometry decisions. Contrary to design practice, the placement of wires is critical to servicing, where they
must ideally be accessible for repair and yet invulnerable to damage. Due to Kapton insulation embrittled
by ultraviolet radiation, impacts to wires can create short circuits vulnerable to runaway arc tracking; three
such incidents have occurred in flight, generating serious failures. Yet technicians must often stand on
wires to perform other tasks, and many components scrape wires during removal and reinstallation due to
arbitrary placement (Figure 12). Moreover, every other system impacts wiring, multiplying the electrical
servicing requirements, particularly during major modifications like the cockpit upgrade (Figure 13).
Accumulated wire repairs require careful tracking to avoid excessive risk of shorts, such as might be
introduced by multiple splices on the same wire (Figure 14). Last year saw 2832 discrepancy reports on
wiring alone, each requiring several hours of processing.

Figure 12: Awkward placement makes wires vulnerable to flight-critical damage. Left: wires in the
midbody can be impacted by personnel or the equipment they are moving. Right: middeck avionics
   boxes routinely scratch or fracture Kapton wire insulation during removal and replacement.

Figure 13: Repairs or modifications to any electrical system impacts wiring in the Orbiter. Left: the
digital cockpit upgrade required tremendous electrical modifications to support new systems. Right:
   this required modifications to vehicle inter-segment electrical connections in difficult locations.

    Figure 14: Performing and managing wire repairs creates enormous process overhead. Left:
  numerous techniques repair or counteract wire damage in different situations. Right: recording,
   tracking, and controlling these repairs require extensive operations and information systems.

Fuel Cells. The Orbiter’s fuel cells provide its primary power supply, and must be located in the midbody
to avoid excessive voltage drops to fore or aft at 28 VDC (Figure 15). Located underneath a shelf in the
belly of the Orbiter, they require specialized cantilever cranes to lift them in and out. Due to placement of
other components, each of the three fuel cells requires a different crane (see next section).

 Figure 15: The Orbiter’s three fuel cells present a major challenge for removal and replacement.
Left: the fuel cells reside beneath a shelf in the forward midbody, bottom left. Right: each requires a
             separate crane, strong and maneuverable, for extraction and reinstallation.

Secondary Connectors. Connectors require a secondary engagement mechanism to ensure binding during
flight. One type of connector-saver (Figure 16) is used on the backs of avionics boxes in the middeck, often
requiring technicians to install them inside hollow tubes using a prosthetic pole. This arrangement makes it
difficult to ensure the connector has tightened the full 0.128 inches from contact to a complete mate.

Figure 16: High-precision connector-savers present awkward challenges for removal and installation.
 Left: a type of multi-pin connector-saver used for avionics boxes. Right: an engineer illustrates how
      connectors reside inside hollow tubes, requiring fine manipulation using prosthetic poles.

Middeck Flight Seats. Accessibility can be impaired by personnel hazards as well as awkward geometry.
The seats in the crew module were recently redesigned to save weight (from 92 to 48 lbs), but their attach
fittings now protrude from the floor, creating a tripping hazard inside the midbody during removal and
replacement; tripping while carrying a chair could cause serious injury. (Figure 17)

Figure 17: The modified flight seats create tripping hazards obstructing access in the middeck. Left:
the new seats’ attach fittings protrude from the floor, unlike the earlier design. Right: during most
operations, these can be covered with floor plates, but not while moving the seats themselves.

Eliminate or Standardize Servicing Interfaces
Reduction and re-use of servicing equipment and procedures can dramatically impact the marginal
overhead factor associated with operational infrastructure and logistics. Such gains may be accomplished
by eliminating unnecessary interfaces between components and ground support, and standardizing those
that remain. Eliminating bad interfaces or interface modes can also save mistakes in processing. Examples
abound:

Excessive Connectors. Fluid and electrical lines running through the Orbiter were designed with little
regard for the prolificity of segment interconnects, each of which requires care, repair, and testing during
processing. Minimizing the number of segments in flow lines would strongly impact their expense.

Cold Plate Lifters. Not only do the cold plates for avionics boxes require awkward care during removal
and replacement, but in addition, different cold plates require different lifters to accomplish the transfer.
Standardizing plate and box interfaces would reduce equipment and hassle.

Power Supplies. The Orbiter’s two flight TACANS units come from different vendors; one runs on DC
power, the other on AC power, hence requiring connections to both busses, and thus more wiring. The
difference is entirely unnecessary, considering that the latter unit converts to DC internally.

Distinct Vehicles. Different Orbiters exhibit key differences due to both design changes and divergence
through processing. Many differences require separate sets of ground support equipment (GSE): for
instance, the distinct tile grid patterns on the different vehicles require separate mold libraries and tracking
systems; the unique forward reaction control systems maintain separate processing accessories; and many
developmental systems on Columbia required even more customized GSE.

RSB Drive Gears. The gears on the Rudder/Speed Brake (RSB) admit two possible modes of installation;
the incorrect mode significantly undercuts the designed load tolerance on the RSB. Its backwards
installation was only recently discovered after decades of flight. (Figure 18)

Figure 18: The Rudder/Speed Brake (RSB) gears were installed backwards for decades, with
potential performance impact. Left: the two rudder panels fan outward symmetrically from the tail
interface visible here to provide braking. Right: the gears operating the panels, seen from the rear.

Remove Conflicts Between Servicing Tasks
The tight integration of the Orbiter and its processing facilities makes coordination of servicing tasks a
veritable nightmare. One might expect that procedures on distinct components could be carried out largely
in parallel; however, some types of service operations impact or halt major subsets of operations, as most
components remain bound together in the Orbiter Processing Facility (OPF) during a flow. Anticipating
and avoiding these conflicts could shave precious time from the flow schedule.

Power Status. Orbiter power up/power down features most prominently of all constraints on OPF
activities. Some procedures require power to be activated to operate subsystem equipment (e.g., thruster
removal and antenna deployment); some require power to be deactivated for safety reasons (e.g., wire and
multiplexer replacement). A finer division of power supplies could allow more flexibility in parallel
processing.

Toxic Chemicals. Operations with the potential to release toxic liquids or vapors often require “area
clears” of the OPF during which other work grinds to a halt. The most prominent examples are hypergolic
fuels, used in the Orbital Maneuvering System (OMS) pods and forward reaction control systems (FRCS),
which are removed and replaced during each flow. In addition, all personnel are required to receive training
in handling various toxic chemicals, a major cost to operations. Additional equipment is required to guard
against safety hazards even when dangerous components are in stasis. Separating operations on these
components between facilities could provide one approach to this problem. (Figure 19)

Figure 19: Hazards from isolated components impact all processing activities in the OPF. Left:
Removal and replacement of OMS pods requires clearing the OPF; after reinstallation, covers and
desiccant tubes (blue) are required to monitor potential fuel leakage. Right: life support bottles
provide emergency breathing air for personnel working inside the Orbiter during an accident.

High-Energy Radiation. X rays are used for many kinds of damage inspection, and require clearing or
shielding surrounding areas to reduce risks to personnel.

Component Geometry. Elements of the Orbiter transition among multiple configurations during the
processing flow, providing or limiting access to other elements. Sometimes particular configurations
require substantial support equipment, as in the case of the enormous strongbacks used to support the
payload bay doors when open.

Develop Servicing Procedures with Design
Concurrent development of design and operations is an overarching recommendation of this paper;
however, this section is meant to illustrate the more specific impact of modifying operations procedures
after the design is largely locked down. These examples represent seemingly unavoidable changes
mandated by critical elements missing from the original concept of operations. In aircraft development, the
“system repair manual” is developed in conjunction with the vehicle; this produces far superior results
compared to post facto development. Such integrated, systematic development also improves the quality of
servicing documentation, largely due to an increased recognition of purpose and the involvement of
original designers.

Ascent Debris Management. The flurry of changes precipitated by the Columbia accident investigation
has generated enormous turbulence in operations related to Orbiter ascent debris (things falling off the
Shuttle). New imagery requirements called for opening the OPF bay doors on a cold night to take
electromagnetic baseline imagery for launch cameras; in addition, the launch imagery upgrade to HDTV
will require extensive analysis and interpretation to account for all phenomena previously unobserved with
lower-quality footage. Meanwhile, a lengthy research program has been initiated through the Southwest
Research Institute to construct specifications of allowable limits on different debris types as a function of
altitude. At the same time, foam application procedures have been modified for new External Tanks (ETs),
eliminating the bipod ramp foam that proved dangerous in earlier missions and adding new heaters for ice
prevention; this will require modifying existing ETs in storage, and changing inspection procedures. Even
more drastic measures are now being examined to limit foam shedding, including binding agents, hypelon
painting, and “shrink-wrapping,” any of which would stir up a whirlwind of performance and processing
issues. These debris management activities might have been addressed during design, but were largely
neglected at the time. (Figure 20)

Figure 20: Operational changes dealing with ascent debris have impacted many sectors of Shuttle
processing. Left: bipod ramp foam breakaway on STS-112 [Credit: CAIB]. Right: the new ET-120
incorporates modified components to limit foam shedding [Credit: NASA].

Dynatube fittings. Fluid fittings on the Orbiter called “dynatubes” are made of stainless steel polished to
very fine tolerances. Even slight damage to these fittings can cause leaks, which can be detected only by
deploying a mass spectrometer to search for gas traces in the surrounding air. Had leak monitoring been
considered in the design, a much lower-impact solution might be feasible.

Design for Disposal
Often the most neglected servicing operation is the last: discarding system components no longer needed.
Interactions with the outside environment sometimes require additional measures for handling disposal or,
conversely, introduce unanticipated constraints on the operation of the system.

Solid Rocket Fuels. The highly toxic particulates expelled from the Solid Rocket Boosters (SRBs) coat the
entire Mobile Launch Platform (MLP) and surrounding areas following a launch, and contaminate the
water of the thermal reservoir in addition to the platform flush water. None of this water can be reused or
released to natural reservoirs, but instead must be loaded into large fleets of trucks for safe disposal at
remote hazardous waste sites. Alternative designs might have provided better containment of fuel residues.

External Tank. Designers naturally view the ocean as a large, relatively uninhabited, circumplanetary hole
in the ground, perfect for discarding an ET from almost anywhere above the Earth. However, constraints of
safety and diplomacy have limited the ET ditch area to a relatively small patch of the Indian Ocean. This
restriction significantly reduces flexibility in launch trajectories, and impacts liftoff capacity and launch
windows (particularly for missions to the International Space Station: when one such launch experienced a
main engine cutoff three seconds early, nearly all available margin was consumed). Realization of these
issues might have suggested changing the reentry characteristics of the ET or carrying it to orbit.

Modernize Information Systems
Rocket technology has not advanced much since the inception of the Shuttle program, but information
technology has revolutionized nearly every technical enterprise. Because the Shuttle’s operational
requirements have not changed appreciably over the past few decades, it has experienced fewer
opportunities for iterative modernization than other, more dynamic organizations. NASA and USA
employees of all capacities suggested improvements to information systems that could improve Shuttle
processing.

Extend Software Capabilities
Modern software can automate much more information manipulation than currently realized, through
higher-level representations of data and slightly more sophisticated algorithms. The tedium of human
involvement in repetitive, mechanical tasks not only saps valuable time, but reduces morale and creativity
within the organization. Some particularly high-impact applications:

Task Scheduling. Current scheduling software can encode serial linkages between tasks (A must follow
B), but does not incorporate resource availability or other task conflicts, including availability of personnel.
Instead, engineers and technicians must recognize these constraints and manually adjust task schedules
accordingly. It is not a one-time process: frequent anomalies require readjusting the task schedule at the
beginning of every shift, based on progress reports from the previous shift. Some quotes from a USA
scheduling meeting demonstrate the routine problems with this approach. First, hastily generated written
progress reports foster ambiguity: “I can’t tell what these guys did last night … what happened.” Second,
human constraint management is error-prone: “We scheduled a job we can’t work … wouldn’t be the first
time.” Even when manually generated schedules are valid, they are unlikely to be optimal: humans cannot
effectively search large, complex option spaces without computational assistance. One-time encoding of
resource and configuration requirements for planned events, with real-time updates for unplanned ones,
would allow dynamic scheduling to be performed by suitably enabled software, substantially reducing
these persistent problems.

Drawing Management. Engineering drawings are a universal headache for Shuttle engineers, but some
simple extensions of drawing tools could ease much of the pain. Some systems in use today do not accept
incremental changes, requiring complete duplication in order to maintain the historical record; this makes
modifications more cumbersome, and discourages drawing maintenance. Systems like CATIA provide this
simple yet valuable functionality. Another requested capability is intelligent numbering, whereby
components’ numerical codes automatically reflect useful data like the project, assembly, and part number.
This simple automated utility would also save time and energy in drawing management.

Replace Communication Tools and Methods
Small organizations may accrue few benefits from deploying and maintaining more sophisticated
communication tools, but beyond a relatively low size threshold their benefits can be enormous. The
Shuttle program still relies on some strikingly anachronistic practices that have been widely replaced in the
large technology-driven companies of today. Some key investments in this area could create huge savings:

Operations Documentation. Several engineers I observed at KSC appeared to devote large fractions of
their daytime work hours to carrying paper documents between offices, floors, and buildings to obtain the
signatures required for closure on various procedures. In many cases, these signings involved negligible
discussion, so even face-to-face contact provided no additional value. One USA engineer estimated that a
typical flow involves ~50,000 separate documents, roughly half of these requiring additional NASA
signatures. The impact of this practice goes beyond absolute time spent going places and seeking people: it
lowers the overall “tempo” of the organization, and reduces the natural frequency of task completion from
minutes to hours or days. Paper-induced inertia also hurts institutional morale. NASA has recently begun
experimenting with “change management express” software to digitize processing and signing; while
certainly a step in the right direction, it is still catching up with the rest of even the aerospace world: for
example, the Boeing C-17 program used a predominantly paperless system throughout, to great effect. The
more universal and integrated these measures become, the more effective they will be. (Figure 21)

Figure 21: Paper documents abound wherever work is done. Left: the OPF floor—technicians and
engineers process piles of forms beneath the Orbiter. Right: the Shuttle Tile Facility—every part, no
matter how small, travels with its own documentation. Many other documents travel on their own.

Technical Discourse. Teleconferences have remained the primary means of technical discussion for NASA
and USA engineers since the birth of the program. Engineers joke about them as the “trailing edge of
technology,” with good reason: the serial flow of telephone conversations tends to send issues in circles,
leaving moderators little ability to organize and unify presented information. This limitation not only slows
real communication, but may affect the ultimate quality of results, since moderators tend to wrap up
decisions with the most recent discussion closer to mind. Teleconferences also make poor use of personal
attention: though individuals can plan partial attendance based on published agendas, the meetings’
inherent fluidity still leaves them sitting in on many talks irrelevant to their own issues. More focused and
flexible systems, such as shared web pages for specific topics, could provide a better means to dynamically
organize information and target engineers’ mental energy.

Streamline, Standardize, Synthesize
Staff at KSC devote a tremendous amount of time to using information systems, hence improved efficiency
in these areas can generate tremendous savings. Suggested improvements generally associate with one of
three major approaches: streamline—reduce the time required to perform a given data manipulation;
standardize—use common formats for the same data across different systems; and synthesize—combine
systems containing the same data into a single system wherever possible. These approaches could be
particularly useful in the following application areas:

Data Reporting. Many software reporting systems incorporate redundant process data, which must be
manually entered into multiple systems in different formats by engineering staff. One example is the PITA
and ELOG systems used in the electrical division, which could be at least partially combined. Separate but
redundant systems within different divisions can also create authorization barriers, and should be avoided.
Data collection systems could also be more unified throughout the program, so that all data sources
generate the appropriate formats from the outset. The Lockheed-Martin Atlas V program illustrates the
impact of standardized data systems, with an estimated ~40% cost savings over comparable programs.

Component Logistics. Tracking and inventory management for both parts and equipment could benefit
from a number of modern technologies. Universal bar code or radio frequency identification (RFID)
tagging could help with locating, moving, and storing components, as used by military programs and
companies like WalMart. Some other automation technologies have already made substantial
improvements to logistics operations (Figure 22). In addition, some past programs have allowed multiple
specifications for identical materials and commodities (e.g., isopropyl alcohol), multiplying costs of
purchasing, tracking, and storing; combining specifications would eliminate these redundancies.

Figure 22: Automation technologies have substantially streamlined parts handling in the logistics
warehouse. Left: forklifts and other vehicles follow magnetic aisle guide strips and shelf geometries.
Right: a robotic fetcher handles small parts independently via an access port at the end of the aisle.

Configuration Control. While most data handling functions require simply read-only repositories,
configuration control presents the further challenge of synchronizing dynamically updated state information
across the program. The timeliness and accuracy required of such information—both to save effort and
prevent mistakes—would make program-wide standardization of configuration control systems particularly
valuable.

Capture Institutional Knowledge
All the enhancements of information systems suggested above will help to encode more institutional
knowledge in persistent, usable forms; however, in some cases, dedicated measures may be necessary to
mitigate critical information loss in the life of a large vehicle program. Experiences from the Shuttle point
to two danger areas:

Brain Drain. Senior NASA and USA engineers recall a few key periods during which the program
experienced major turnover in technical staff. The most dramatic such “brain drain” now appears imminent:
the people who began the Shuttle program as young graduates are now reaching retirement. With them will
transpire a great deal of perspective accumulated from working with the program since inception. Though
difficult to measure, most believe the effects will prove significant. More clear and comprehensive records
describing design history and decision logic could help compensate for the transience of individuals.

Engineering Drawings. The original Shuttle drawings have not generally been updated to reflect design
modifications. Because the contractors own the drawings, NASA has no leverage to control their
maintenance. Following the Columbia accident, the program planned a system-wide overhaul of drawings
in accordance with CAIB recommendations; however, this plan was dropped for lack of the estimated $12
million in required funding. The lack of current drawings hampers engineering analysis and processing
activities to this day, an obvious target for change in future programs.

Replace Some Key Technologies
The Shuttle was developed to take advantage of many cutting-edge technologies and hardware of its time in
a wealth of engineering disciplines. Several of these techniques and components have proven problematic
in more or less unanticipated ways, and should be reconsidered for future applications, independent of more
systemic issues of design methodology. Some examples:

Galvanic Couples. Many pairs of metals placed in contact with each other will form galvanic couples,
which gradually lead to chemical breakdown. The Orbiter exhibits ubiquitous galvanic coupling between
metal parts, leading to extensive corrosion requiring constant inspection and maintenance. Such cases
should be largely avoidable with careful design.

Kapton Insulation. One of the largest issues in Shuttle processing, breakdown of Kapton wiring insulation
(due partly to ultraviolet radiation and atomic oxygen exposure) leaves it vulnerable to arc tracking,
wherein the insulation carbonizes to a conductive state. Carbonized Kapton has a resistance of ~10 ohms,
which on a ~30-volt power supply leaves it just below the current limit on the Orbiter’s circuit breakers,
allowing short circuits to persist, particularly between feed and return lines that often run together. The
insulation becomes brittle over time, after which impact or flexing can easily cause shorts, leading to
further degradation and arc tracking. Arc tracking is responsible for failures on at least three flights,
including two Main Engine Controllers on STS-93, which could potentially have led to a double main
engine shutdown and subsequent loss of vehicle; it was investigated as a possible cause of the Columbia
accident. Though Kapton combines many desirable properties, military systems no longer use it for
electrical insulation; other compounds such as Teflon hybrids may prove more suitable for future vehicles,
though replacing the ~150 miles of Kapton insulation in each Orbiter is no longer feasible. (Figure 23)

Figure 23: Kapton wiring insulation creates major risks of short circuits in Orbiter electrical
systems. Left: arc tracking damage can lead to shorts in surrounding wire bundles. Right: bundles of
Kapton wiring pervade the Orbiter, often in remote locations like this wing interior. [Credit: CAIB]

Data Lines. Some of the wiring burden derives from the use of dedicated lines for data transfer throughout
the Orbiter. More modern vehicles (and other systems) use data busses to merge data handling into a few
combined electrical lines, significantly reducing overheads associated with wiring. One engineer suggested
an arrangement “like a motorcycle,” with one CPU each at prow and stern and a single universal data bus
in between.

Secondary Fasteners. Shuttle requirements mandate secondary mechanisms to verify torque and locking
on structural fasteners, an area presenting unanticipated challenges. One common method is self-locking
nut plates, which provide additional locking friction in the final tightening of the fastener. However, these
exhibit a number of problems: first, they wear down, losing their self-locking feature after 5-7 fastening

cycles; second, they must be riveted into the structure, often in blind locations; and third, their rivet holes
enlarge and lose strength with each removal and replacement during servicing. Many structures engineers
currently advocate an alternative approach known as “locktight,” a coating placed on threaded fasteners
that hardens to bind them after mating; this approach was not certified with the original design, and has
hence proven difficult to put into service.

Fluid Connectors. The Shuttle currently employs fluid connector fittings with low tolerance to damage or
misalignment. Their small pins reduce the size of connectors, but greatly increase the risks of hang-up and
separation; they are also highly susceptible to damage and corrosion. Pneumatic connectors show persistent
leaks, and some require laser polishing to meet acceptable tolerances. Many engineers have suggested other
types of commercially available fittings (such as KC fittings), which are more damage-resistant, easier to
engage, and more robust to leakage in misalignment.

Flex Hoses. Orbiter flex hoses generally consist of metal bellows with braided stainless steel overwrap.
When bent to a particular radius of curvature, their tension-compression cross-section flattens and the
resulting stress spikes cause cracking. They are also susceptible to fatigue from low-frequency resonances
during transport. Their braided overwrap tends to collect moisture, leading to corrosion which traps
contaminants, creating a difficult cleaning chore. All these problems could be mitigated by working with
stiff stainless steel tubing rather than flex hoses.

Solid Rockets. The impact of solid propellant hazards and their secondary ramifications has been
consistently underestimated. The Shuttle program has routinized solid rockets to unprecedented levels (a
visiting engineer from the Russian Space Agency ran from the VAB at the sight of fueled booster segments
on the floor). Infrastructure for processing, cleaning, testing, and repair requires extensive built-in safety
measures. Toxic particulates require clearing areas during certain procedures and even yet contribute to
some occupational health problems. Anomalies associated with the Solid Rocket Boosters (SRBs) tend to
create major schedule upsets. During transport, the boosters require extensive protection from sources of
electrostatic discharge, such as lightning. As mentioned earlier, their exhaust toxifies wastewater from each
Shuttle launch; moreover, their propellant grain penetrates the MLP blast shield, destroying cables inside
the launch service towers. In the final analysis, though motivated by the promise of reusability, the SRBs’
refurbishment costs have proven comparable to the costs of pure replacement. Their substantial risks have
led USA engineers to recommend against using solid propellant devices wherever possible. (Figure 24)

Figure 24: The hazards associated with toxic, explosive SRB solid propellants reach far and wide.
Left: facilities handling fueled boosters, like the VAB, must take special precautions to avoid leakage
or ignition. Right: the booster discharge destroys cables in launch service towers (one shown at left)
and contaminates the wastewater flushing the MLP at launch (pipes at center and bottom).

Thermal Protection System. The litany of issues associated with the Orbiter thermal protection system
(TPS) derives in part from a key design decision: the separation of the TPS from the airframe. The tile
system was expected to work something like a jigsaw puzzle, with tiles snapping in and out as needed; the

airframe architecture contributed to the failure of this concept in multiple ways. First, the differential
thermal expansion of the airframe and TPS necessitated high-precision gaps between TPS tiles, driving up
the costs of tile inspection and replacement. Second, thermal cycling led to tile cracking and slight
perturbations of the tile grid on every flight, a major contributor to processing requirements. By contrast,
the X-33 design made the TPS and airframe one and the same, eliminating this fundamental problem.

Figure 25: Interactions between the separate TPS and airframe cause many of the problems
afflicting the tile system. Left: the gaps required to accommodate differential thermal expansion
necessitate manually measuring super-fine tolerances on tile geometry. Right: because of grid shifts,
tiles change shape between flights, hence their molds must be iteratively modified at each repair.

Plan Programs, Not Vehicles
Much more than in other industries, design in aerospace has traditionally focused on individual products
(often vehicles) rather than long-term support for particular capabilities. This may be partly attributed to the
relatively large scale and expense of these individual products in the aerospace sector. However, this
practice can impair the cost-effective achievement of underlying programmatic goals. In particular, it
reduces the frequency of product iteration far below that of most other industries, creating numerous
technical and organizational issues. More iterative, globally focused programs could mitigate these
problems; in particular, they could:

Increase Operational Feedback to Design
As we have seen, anticipating the operational impact of design decisions can be quite challenging, despite
great effort. In the words of one NASA engineer, “as a designer, you only have so much time to figure out
how to do it all right.” As the first reusable space vehicle, very little knowledge gained from operating the
Shuttle has ever been exploited to design another such vehicle. At the same time, feasible design changes to
the Shuttle itself are small, few, and far between: merely beginning the Engineering Order process can cost
upwards of $10K. This suggests Frederick Brooks’ famous maxim: “Plan to throw one away.” There will
never be a complete substitute for significant design changes based on feedback from the field. Some
examples:

Structural Optimization. Due to the complexity of structural analysis and lack of relevant flight data, the
Orbiter’s original structure and layout suffers from many suboptimalities. The inertial configuration of the
vehicle (CG placement, etc.) proved poorly estimated, ultimately leading to the removal of useful front-
wheel steering (to save nose weight) and common use of wasted ballast for payload balancing. Meanwhile,
loading tolerances were often pessimistic: while a few components have been replaced to save mass (such
as the heavy rear “flipper doors”), most structural elements are too highly integrated into the airframe to be
practically modified. Redesign with better data and improved technology would allow substantial
corrections to these deficiencies.

You can also read