Previous Page  54 / 82 Next Page
Information
Show Menu
Previous Page 54 / 82 Next Page
Page Background

Figure 1. Standards for functional safety of silicon IP

that not all faults will lead to hazardous

events immediately. For example a

fault in a car's power steering might

lead to incorrect sudden steering

action. However, since the electronic

and mechanical designs will have

natural timing delays, faults can often

be tolerated for a specific amount

of time. In the ISO 26262 this time

is known as the fault tolerant time

interval, and depends on the potential

hazardous event and the system

design.

What’s at fault?

Failures can be systematic, such as

due to human error in specifications

and design, or due to the tools used.

One way to reduce these errors is to

have rigorous quality processes that

include a range of plans, reviews and

measured assessments. Being able

to manage and track requirements

is also important as is good planning

and qualification of the tools to be

used. ARM provides ARM Compiler

5 certified by TÜV SÜD to enable

safety-related development without

further compiler qualification.

Another class of failure is random

hardware faults; they could be

permanent faults such as a short or

broken via as illustrated by Figure

2. Alternatively they could be soft

errors caused by exposure to natural

radiation. Such faults can be detected

by counter measures designed into the

hardware and software, system-level

approaches are also important. For

example Logic Built-In-Self-Test can

be applied at startup or shutdown in

order to distinguish between soft and

permanent faults. Error logging and

reporting is also an essential part of

any functionally safe system, although

it’s important to remember that faults

can occur in the safety infrastructure

too.

Selection of counter measures is part

of the process I enjoy the most, it

relates strongly to my background

as a platform and system architect,

and often starts with a concept-level

Failure Modes and Effects Analysis

(FMEA). Available counter measures

include diverse checkers, selective

hardware and software redundancy,

as well as full lock-step replication

available for Cortex-R5 and the ‘old

chestnut’ of error correcting codes

which we use to protect the memories

of many ARM products.

Get the measure of

functional safety

Faults that build up over time without

effect are called latent faults and

ISO 26262 proposes that a system

designated ASIL D, its highest

Automotive Safety Integrity Level,

should be able to detect at least 90%

of all latent faults. As identified by

Table 2, it also proposes a target of

99% diagnostic coverage of all single

point failures and a probabilistic metric

54 l New-Tech Magazine Europe