Leading the Way in Reliability Prediction Part 1

Part 1: The Reliability Prediction Standards

(This is Part 1 of a 2-part series. Read Part 2: The Relyence Advantage.)

Reliability Predictions offer a key advantage as part of your overall reliability analysis toolset. Reliability Predictions are used to estimate the predicted failure rate or MTBF of a product or system during any portion of the product lifecycle. Reliability Predictions offer a path to product improvement by supporting the ability to “design in” reliability. At the early design stage, Reliability Predictions enable you to perform an assessment of likely failure rate characteristics and then make design changes as needed for areas of weakness. Reliability Predictions can also be used in the early design phase to evaluate different design options by taking into account the reliability profiles of the various alternatives. Allowing you to perform design trade-off analysis with metric-based assessments empowers you to make the best decisions for your business.

Reliability Prediction’s historical roots are in the military and defense sector, but over the years have been adapted and broadened for use in a wide range of industries. Essentially, the advantages afforded by reliability prediction analyses make it an important part of managing and maintaining reliability and quality objectives.

Reliability Predictions offer a set of equations to model the failure rates of a variety of electromechanical components that make up a product or system. These equations were built by analyzing a huge amount of data over a long period of time and then using statistical analysis to determine the equations which best modeled the failure characteristics of the accumulated data. The variables used in the equations vary based on component type, but include data such as device ratings, temperatures, operating parameters, and environmental conditions.

There are several widely accepted Reliability Prediction standards, and Relyence supports all of them. With Relyence Reliability Prediction, you can perform reliability analyses based on the standard you prefer, including:

MIL-HDBK-217 F Notice 2 Part Stress
MIL-HDBK-217 F Notice 2 Parts Count
Telcordia Issue 4
217Plus
217Plus Parts Count
ANSI/VITA 51.1 Part Stress
ANSI/VITA 51.1 Parts Count
NSWC-11 Mechanical
China’s GJB/z 299C Part Stress
China’s GJB/z 299C Parts Count

MIL-HDBK-217

MIL-HDBK-217 is one of the most widely known Reliability Prediction standards. It was one of the first models developed, and many other reliability standards available today have their roots in MIL-HDBK-217.

MIL-HDBK-217, or Military Handbook: Reliability Prediction of Electronic Equipment, was originally developed and published for use by the Department of Defense. Over the years there have been many updates to the MIL-HDBK-217 document, which have resulted in the suffix designations in the document name: MIL-HDBK-217D and MIL-HDBK-217E Notice 1 for example. The latest release of MIL-HDBK-217 is MIL-HDBK-217F Notice 2.

There are two components of the MIL-HDBK-217 standard: the Part Stress section and the Parts Count section. The Part Stress section leads off the document and includes a number of equations that model the failure rate for a wide variety of electrical components. For example, the equation for Microcircuits, Gate/Logic Arrays and Microprocessors is:

λp = (C1 * πT + C2 * πE) * πQ * πL

where λp is the failure rate in failures/million hours (or failures/10e6 hours, or FPMH)

The factors in the equation are various operating, rated, temperature, and environmental conditions of the device in the system. For this above equation, the following list describes the variables:

C1 factors in the complexity of the device, such as the number of gates or transistors
πT factors in the ambient temperature and any temperature rise associated with the device
C2 factors in the package of the device, or how it is manufactured and placed in the system, such as surface mounted and/or hermetically sealed
πE factors in the environment that the device is operating in, such as in space, in an aircraft, in the sea, on the ground, etc.
πQ factors in the quality of the device based on how it is procured
πL factors in how long the device has been manufactured

The equations, the variables, and the data parameters needed vary for all the different components modeled. The Part Stress section of MIL-HDBK-217 includes complete details on all the equations and how to assess the variables used in the equations.

The second section, Parts Count, is useful in early design stages when the design is still in progress and not all operating parameters are known. Parts Count predictions do not require as many data parameters for analysis compared to Part Stress predictions. Parts Count analyses can be used as an estimation technique, and, in general, are not as accurate as Part Stress analyses. By using Parts Count models, you can obtain early failure rate assessments and then refine them as your product design becomes more finalized.

For example, the equation shown above for Microcircuits, Gate/Logic Arrays and Microprocessors in Parts Count is:

λp = πg * πQ

where πg is a generic failure rate based on a subset of information; in this example it is based on device technology type, environment, and device complexity.

In many cases, Parts Count is used to start a Reliability Prediction analysis. Then, as the product design becomes more firm and data parameters are established, the Parts Count prediction is moved over to Part Stress, maintaining all the data already entered during the Parts Count assessment.

Telcordia

Another widely used and accepted Reliability Prediction standard is commonly referred to as Telcordia. Early on, Telcordia was referred to as the Bellcore standard. The full name of the Telcordia standard is Telcordia: Reliability Prediction Procedure for Electronic Equipment, Special Report SR-332. The Telcordia standard has also been through several updates and revisions, which are designated by the Issue Number. Telcordia Issue 4 is the latest Telcordia Reliability Prediction standard.

Initially, the Bellcore/Telcordia standard was developed for use in the telecommunications industry. Today, Telcordia is commonly used in the commercial sector, but its use over the years has become widespread and is now used throughout a broad range of industries, even those related to military and defense applications.

Telcordia includes equations for the black-box steady state failure rates of devices, as well as equations for the upper confidence level and standard deviation of the black box steady-state failure rates. Example Telcordia equations to compute the black-box steady state failure rate of a device are:

λBB = λG * πQ * πS * πT

where λBB is the failure rate in failures per billion hours (failures/10e9 hours, or FITs)

and

σBB = σG * πQ * πS * πT

where σBB is the standard deviation of the black-box steady state failure rate

The factors used in the equations are:

λG is the device generic failure rate, which is obtained from a series of tables in the Telcordia standard and is based on device parameters which vary according to the device under analysis
σG is the standard deviation of the generic steady-state failure rate
πQ factors in the device quality level
πS factors in the device stresses, such as electrical stress
πT factors in the device temperature stress

Additionally, the πE, which factors in the environmental condition, is factored into the overall failure rate calculation.

Once the device level black-box steady state failure rates are determined, the unit level and system level failure rates can be calculated.

Using the black-box steady state failure rates as a basis, the Telcordia standard includes additional methodologies for augmenting failure assessments by taking into account other data that may be available about the devices, units, or systems under analysis. This additional information is not required, but can be used if available to adjust failure rates to reflect actual product performance. Telcordia Reliability Predictions can:

Compute the upper confidence level of steady state failure rates
Integrate laboratory data from devices, units, or systems with or without burn-in data
Integrate field data from devices, units, or systems with or without burn-in data
Determine early life factors based on no burn-in, limited burn-in, or extensive burn-in

Essentially, any real-world data available can be used to further refine the estimated failure rate values. It should be noted that any of this additional data is not required to perform a reliability prediction based on the Telcordia standard. It is up to the analyst to determine if any of this additional data is available and if it is helpful to include in the reliability prediction analysis. In some cases, Telcordia analyses are initially performed to obtain the black-box steady state failure rates, and then updated as laboratory, field, and burn-in data become available.

In summation, some of the unique features of Telcordia include:

Models for components not found in MIL-HDBK-217, such as lithium batteries, hard disk drives, AC/DC power supplies, gyroscopes, and many more.
Early life calculations to help analyze failure rates during initial product introduction, or the early life phase, when infant mortality rates are a factor.
Augmenting failure rates based on data obtained from laboratory test data. By factoring in test data information, your predictions are weighted according to the amount of test data you have.
Augmenting failure rates based on data obtained from fielded products. By adjusting your failure rates based on this real-world information, your predictions will more accurately reflect your product performance.

217Plus

The 217Plus™ reliability prediction standard was developed by Quanterion Solutions. Work on 217Plus was started under Department of Defense contracts with the Reliability Analysis Center (RAC) and Reliability Information Analysis Center (RIAC), and was released originally under the name PRISM.

The failure rate models of 217Plus have their roots in MIL-HDBK-217, but have enhancements to include the effects of operating profiles, cycling factors, and process grades on reliability.

The official 217Plus standard name is Handbook of 217Plus Reliability Prediction Models. An example equation for capacitors in 217Plus 2015 Notice 1 is:

λ_P =π_G* π_C * (λ_OB * π_DCO * π_TO * π_S + λ_EB * π_DCN * π_TE + λ_TCB * π_CR * π_DT ) + λ_SJB * π_SJDT + λ_IND

where λ_p is the failure rate in failure per million calendar hours.

For the equation above, the following list describes the variables:

π_G is the reliability growth factor
π_C is the capacitance factor
λ_OB is the operating base device failure rate
π_DCO is the operating duty cycle factor
π_TO is the operating temperature factor
π_S is the stress factor
λ_EB is the environmental base failure rate
π_DCN is the non-operating duty cycle factor
π_TE is the non-operating environment temperature factor
λ_TCB is the cycling temperature base failure rate
π_CR is the cycling rate factor
π_DT is the delta temperature factor
λ_SJB is the solder joint base failure rate
π_SJDT is the solder joint delta temperature factor
λ_IND is the induced failure rate

The equations, the variables, and the data parameters vary based on the specific device being modeled.

Once the device failure rates are evaluated, they are summed up to determine a base system failure rate. At this point, further analysis can be done at the system level if more data about the system is available, such as test or field data. By factoring in this information, the 217Plus analysis will provide a more accurate predicted failure rate estimation. At the system level, 217Plus can incorporate environmental stresses, operating profile factors, and process grades. If this data is not known, default values are used.

Additionally, there is a Part Count reliability prediction intended for use in early design when all data parameters are not yet finalized, and provides a simpler approach to prediction calculations. The Part Count section of 217Plus includes a number of tables for device failure rates that are based on the combination of the environment and operating profile of the system. In this case, a table lookup will provide the failure rates for your devices without the need for calculations.

ANSI/VITA 51.1

ANSI/VITA 51.1 is a collaborative industry standard that provides recommended modifications to the MIL-HDBK-217 F Notice 2 Reliability Prediction Handbook to reflect more updated failure rate assessments. The ANSI/VITA 51.1 rules, recommendations, and suggestions take into account changes in device technologies, improvements that have occurred over time since the MIL-HDBK-217 F Notice 2 standard was released, and updated data parameters to more accurately model current device quality and performance.

ANSI/VITA 51.1 is not an independent prediction standard, but works in conjunction with the MIL-HDBK-217 standard. The part data and models for ANSI/VITA Part Stress and Parts Count models are the same as those from MIL-HDBK-217. However, when ANSI/VITA is employed, updates to the inputs to the equation models that reflect more recent designs are automatically factored in.

NSWC Mechanical

Developed by the Naval Surface Warfare Center (NSWC), the Handbook of Reliability Prediction Procedures for Mechanical Equipment details the reliability prediction procedures and models for a variety of mechanical equipment. There are several models defined in this prediction standard for a wide array of mechanical components including gears, springs, pumps, gaskets, and many more.

Because operating conditions can vary across applications, the NSWC models take operating conditions and material properties into account for failure rate calculations. The appropriate factors vary depending on the part being modeled but can include aspects like misalignment, loading factors, viscosity, and material hardness.

An example failure rate equation for gaskets and static seals per the NSWC-11 Mechanical standard is:

λ_SE = λ_SE,B* C_P * C_Q * C_DL * C_H * C_F * C_V * C_T * C_N

where λ_SE is the failure rate of a seal in failures per million hours, and λ_SE,B is the base failure rate of a seal as denoted in the standard.

For the equation above, the C variables consider the effects of the following factors on the seal failure rate:

C_P – fluid pressure
C_Q – allowable leakage
C_DL– seal size
C_H – contact stress and seal hardness
C_F– seal smoothness
C_V– fluid viscosity
C_T– temperature
C_N– contaminants

China’s GJB/z 299

China’s GJB/z 299 is the most widely used Reliability Prediction standard in the extensive Chinese market. The full name of the standard is GJB/Z 299: Reliability Prediction Model for Electronic Equipment. Its revisions and updates are designated with suffix notations similar to MIL-HDBK-217. The most recent China GJB/z standard is China’s GJB/z 299C.

China’s GJB/z 299 Reliability Prediction standard has its roots in MIL-HDBK-217, and has been developed to align with the procedures and devices found in China.

In a similar fashion to MIL-HDBK-217, there are two components of the China’s GJB/z 299 standard: the Part Stress section and the Parts Count section. The Part Stress section includes complete details on all the equations and how to assess the variables used in the equations. Parts Count predictions do not require as many data parameters for analysis compared to Part Stress predictions, and are meant to be used in early design when not all data parameters are known. Typical usage is to start with a Parts Count analysis and then move to a Part Stress prediction as the design becomes more finalized.

China’s GJB/z 299 also includes an appendix for failure rate analysis for imported components, or those not manufactured in China. This enables the Chinese reliability prediction standard to be used across a broad range of products that include components manufactured across the globe.

An example equation from China’s GJB/z 299 for Bipolar Digital Circuits is:

λp= πQ * [C1 * πT * πV + (C2 + C3) * πE] * πL

Where:

λp is the failure rate in failures/million hours (or failures/10e6 hours, or FPMH)

πQ factors in the quality of the device based on how it is procured.
C1 and C2 factor in the complexity of the device, such as the number of gates or transistors.
πT factors in the ambient temperature and any temperature rise associated with the device itself.
πV factors in the voltage stress.
C3 factors in the package of the device, or how it is manufactured and placed in the system, such as surface mounted and/or hermetically sealed.
πE factors in the environment that the device is operating in, such as in space, in an aircraft, in the sea, on the ground, etc.
πL factors in how long the device has been in production.

Relyence Reliability Prediction

Relyence takes the very detailed process of performing a reliability prediction analysis based on one of these standards and makes it easy, accurate, and efficient.

Relyence incorporates a host of features around the core equation modeling part of predictions to make your analysis more complete and effective. Some examples include:

Built-in Parts Library to pull in device data parameters automatically
Model extensibility which enables you to combine the unique features and advantages of each standard across all standards, such as incorporating laboratory test data into MIL-HDBK-217 analyses
Dashboards which provide a high-level overview of reliability metrics
Visual system modeling to enable you to build graphical diagrams for better understanding
Product hierarchical breakdown into subsystems for efficient calculation roll-up to obtain system wide metrics
Built-in defaults for quick failure rate calculations when all data parameters are not available
User-defined defaults to replace or supplement built-in defaults
Rich and completely customizable reports
Support for What-If? analyses and design trade-off evaluation with fast, accurate calculation

Part 2 of this series dives deeper into these capabilities and details the advantages of Relyence Reliability Prediction.

Relyence Free Trial

At Relyence, our mission is to build not only the most capable tools, but also the most technologically advanced and well-crafted applications available. We rely on our expertise to build the tools reliability experts expect, and couple that with a design elegance and utility that makes our tool suite stand out.

Relyence offers a free fully functional trial of our complete tool suite – register for yours today! Feel free to contact us, or call today at 724-832-1900 to speak to us directly about your requirements or to schedule a free webinar.

Relyence Reliability Prediction: Leading the Way in Reliability Prediction Analytics Part 1

Relyence Reliability Prediction: Leading the Way in Reliability Prediction Analytics Part 1