If youâve ever tried to gets your facilityâs alarm management system under control in the past, you might have found that it was a bit like whacking moles: As soon as you get a few knocked down, up pop a few more.
Alarm management is pretty widely recognized as an issue in the process industries, with alarm-related problems costing more than $20 billion a year to U.S. industry. So more and more plants are working to bring their stale, chattering and nuisance alarms under control. Unfortunately, if you try to do that without having a plan or without taking the time needed to do it right, those moles will just keep popping up.
The Husky PG Oil Refinery in Prince George, B.C., faced such a scenario. They made inroads in trying to improve the alarm situations that operators complained about, but never really made the progress that was needed. âOperators complained that there were too many alarms, especially on upset conditions,â notes Yiqun Ying, senior staff process control engineer at the Prince George refinery. âSome alarms donât mean anything, and the operator doesnât have time to respond. They just hit the keyboard and acknowledge it, and sometimes hit repress.â
Too many alarms
This situation is all too common among process operators who are inundated with more alarms than they could possibly acknowledge, let alone respond to appropriately. Steve Elliott, Triconex product director for Invensys, tells of a time when he was observing one customerâs control room. He watched an operator on his personal laptop, his back to the control screens. As an alarm came in, he swung one arm back behind him and quickly silenced the annoying alarm, without ever shifting his focus away from his laptop.
âOne of the biggest challenges still is the human element in everything we do; the unpredictability of the human,â Elliott says. âWeâre probably putting more reliance on the human factor than we ever have done. And that operator has more information than ever to deal with.â
Alarm management has always been a problem, notes Mike Chmilewski, vice president of DCS and safety solutions for Invensys, because itâs so easy to just keep adding more alarms as more situations arise. âThe next thing you know, thereâs more information coming in that wasnât part of the original design,â he says.
Whatâs driving the alarm count? Chmilewski says itâs more and more I/O points; better instrumentation, which is providing cost-effective measures for advanced process control (APC) and other optimization methods; and additional data points âthat could be alarmed and often areâ because it takes very little effort to add an alarm.
Poorly performing alarm systems often contribute significantly to industrial incidents. BPâs Texas City refineryâwhere a hydrocarbon vapor cloud explosion killed 15 workers in March 2005âis often referenced as the poster child of poor alarm management. âWe believe better operator information would have prevented that incident,â says Eddie Habibi, founder and CEO of PAS, an alarm management focused company. âAlarms and operator displays were two critical elements of the BP Texas City incident. The operator was blind to emerging operator limits.â
Equipment reliability has come a long way over the years, Habibi notes,âbut we have overwhelmed the operator with information.â Automation has focused increasingly on process optimization to get that last molecule of gasoline or hydrocarbon out of the crude oil. âTo do so, weâve put in complex systems. Weâve put so much information in front of the operator, the operator has just become blinded. Heâs driving blind,â he says.
Control operators are increasingly managing operations by alarm, according to Ian Nimmo, president and founder of User Centered Design Services (UCDS), which designs optimized control rooms, particularly within refining and petrochemical industries. Speaking at a conference recently, Nimmo complained that his team often sees one-third of the controllers on manual control.
Operating too much by alarms, todayâs operators are too often reactive rather than proactive, Nimmo says. âIf you have reactive operators, we have an issue; I want to see proactive operators,â he adds. Todayâs operatorâoften tasked with more plants to manage, and with limited time and exposure with each unitââhas been brought up to believe that the automation system is doing its job. He sits around until an alarm goes off, and then he does his job. Thatâs not good.â
Nimmo likens the situation to a pilot flying by alarmsâflying up until reaching an upper limit and getting an alarm, and then flying down again until reaching a lower limit. âWhat would an airplane be like if we flew it by alarms?â he asks. âWouldnât be very comfortable, would it?â
Huskyâs refinery in Prince George has about 3,500 process/equipment variables in its DCS. Those processes are monitored in an alarm-driven environment, with plant operators using the alarms as the primary indication of an event requiring action or attention.
A good indication of a plant doing things correctly is zero standing alarms, Nimmo says. According to EEMUA best practices, standing alarms should be at fewer than 10. ISA says fewer than five. The average number of standing alarms for the oil and gas industry, however, is 50. Petrochemical plants are seeing double that, on average.
Documenting the problem
So, although the state of Huskyâs alarm management wasnât terrible, it wasnât ideal either. They had no site-specific alarm standard, incomplete documentation, and no system to track the changes of the alarms. The average number of alarms wasnât bad, but peak alarms were beginning to get out of hand, and the average number of standing alarms was well outside not only best practices but also industry averages.
Also, although the alarm situation at Husky had been getting better, in Yingâs observation, improvements werenât being documented. Documentation was difficult, he says, with all projects going through contractors or consultants, and alarms being designed differently each time. âSome [projects] have more alarms than others. Theyâre all over the place. Sometimes weâd make changes, but all the changes we made, we didnât document. We kind of lost track of things,â he says.
The staff tried to use a spreadsheet to track alarm changes, but that was difficult to keep updated, Ying says. Plant staff knew they needed a better systemânot only a better alarm system, but also a better way to keep track of changes in that system.
The important takeaway for any facility trying to manage its alarms: âKnow what youâre trying to accomplish before you start throwing money at it,â says Kim VanCamp, product manager, DeltaV abnormal situation prevention, at Emerson Process Management.
Although companies are approaching alarm management and learning a lot about what to do to get their systems in shape, a lot of clients are not even taking full advantage of the alarm management tools that they purchase, according to Kevin Brown, global best practice lead for Honeywell Process Solutions. âThey use a couple reports, but theyâre not incorporating the tools into their daily work practices,â he says. âTheyâre not succeeding to the level they could, or they actually backslide.â
In spring 2012, Huskyâs Prince George refinery set out to develop and implement an alarm management system based on EEMUA-191 and ISA-18.2 standards. The project involved setting up an alarm philosophy document to clearly define the site-specific alarm standard; installing alarm management tools and software; documenting the setting, cause, consequence and corrective action of alarms in a master alarm database; improving plant alarm systems; and establishing an alarm management/rationalization lifecycle.
Since 2009, the ISA-18.2 standard (ANSI/ISA-18.2-2009: Management of Alarm Systems for the Process Industries) has been instrumental in helping plants understand what they are trying to accomplish and how they are going to measure it, VanCamp says. âCustomers felt like they hadnât received good value on whatever they did around alarms. Now theyâre regrouping around the ISA standard,â he says, noting that people didnât know what âgoodâ was. âThey have a better starting point now. They have a model.â
When chemical behemoth DuPont set out to overhaul its alarm management system, the company opted to follow the ISA-18.2 standard rather than write its own internal standards, according to Nick Sands, manufacturing technology fellow at DuPont. âThatâs a big change for us,â he says. âWeâre all about trying to make those layers better but, wow, what a culture change.â
>>Â DuPontâs Alarm Management Plan: Click here for more information.
The ISA-18.2 standard provides clear definitions of the common terminology and helps to create a universal alarm management language. It also defines an alarm management lifecycle model, which establishes the recommended workflow processes. The lifecycle sets the framework for understanding the requirements for building an alarm management program.
Conversations around lifecycles involve, for example, what should and shouldnât be an alarm. âIs there an operator action associated with the alarm? No. Then there shouldnât be an alarm,â VanCamp explains. âWill there be a consequence if the operator ignores it? No. Then there shouldnât be an alarm.â
Discussions also involve how to prioritize the alarms. âPrioritization used to be based on the strongest-willed person in the room who had a vote on the matter,â VanCamp says. But considerations should actually be focused on health and safety, economic ramifications and environmental concerns, he adds. âIs there a consequence to inaction? Is there an operator action to take? If he has less than 60 seconds to respond, it shouldnât be up to the operator; it should be designed into the safety system.â
Be rational
One of the most important steps described by the ISA-18.2 standard is alarm rationalization. Although getting rid of the bad actorsâoften the first step in alarm managementâwill quiet the control room, itâs the process of reviewing and documenting alarms that will help keep it that way when something legitimately bad happens with the system.
At a plant that hasnât previously done much with alarm management, Honeywellâs Brown says he expects to be able to achieve a 75-80 percent reduction in bad actors pretty easily. This can quickly bring average alarm rates to ISA standards, which is one alarm every 10 minutes.
Then facilities want to move on to taking care of the alarm floods and they donât think they need to worry about alarm rationalization. âWhat happens with the bad actors, though, is once they get it cleaned up and down to normal rates, they stop focusing on the bad actors,â Brown says. âBut itâs a dynamic systemâinstrumentation is failing, theyâre changing the way theyâre operating, and weâre going to continuously have bad actors showing up. It needs to be incorporated into a regular work routine.â
Even for companies who undertake rationalization, Brown says, sometimes theyâll do it on their own and do it poorly. âWhen you look at the effort and cost thatâs involved with rationalization, thatâs a huge investment thatâs been lost,â he says. âYou really donât want to do it twice.â
>> Who Is On the Alarm Management Team? Click here to read who makes up modern alarm management teams.
Husky is going through the alarm rationalization process in several of its units. âWe go through the alarm rationalization, and document all the alarm settings,â Ying says. âWhen we go through the alarm rationalization, we involve operators and process engineers. We talk about whatâs the consequence if we do not action for the alarm. Whatâs the correct action?â
Documentation of everything is key, Ying points out. âOnce we go through that, all the changes are documentedâwhether we need an alarm or not, what kind of priority we assignâeverything is in the software, and it keeps updating all the time.â
VanCamp adds, âA lot of folks shy away from rationalizationâ because itâs very time-consuming looking at every single thing that could possibly happen. âEvery tag could generate a lot of alarms.â
Make no mistakeâgetting alarms in shape is hard work, and itâs time-consuming too, requiring diligence and perseverance. âRationalization is costly,â Brown agrees. âThereâs only one way to do it. You can try to reduce the amount of work, but thereâs still the effort required to analyze every alarm.â
As part of this process, Ying brings his team together once a week to go over all the alarms for the week, area by area. âYouâll find very interesting things that way,â he says. âYou can see the difference from the process engineer or the operator. It involves a lot of discussion. The process engineer learns from the operations side. And the operators learn from the process side. Itâs a very good conversation.â
Ying says he takes things slowly, making sure to discuss everything necessary. The whole process is not an easy one, he says, requiring considerable time from several people, including bringing operators in on overtime to meet. And itâs an ongoing process that must continue not only to achieve continued improvement, but also to keep the situation from slipping back.
The time needed to really get alarms under control and keep them under control is considerable, to be sure. Simply documenting everything thatâs in the database is time-consuming, Ying notes. But itâs better than doing the work on the alarms and not documenting it, because after a while the work thatâs been done will start to slip.
Alarm management has changed considerably in recent years, according to VanCamp. âIf you go back five years, in the system bid specs, the section on alarms would just be a couple sentences,â he says. âNow when we get bid specs for a big project, we will find a separate companion document thatâs about 50 pages on how they want the alarms to behave.â
Like the example at the beginning of this article about the operator who silenced the alarm without even looking, industry is rife with other such examplesâcontrol rooms in which the hornâs going off, and yet the operatorâs hands arenât doing anything; or operators sticking a penny in an alarm panel so the alarms are constantly acknowledged. An operatorâs sensitivity becomes so deadened that when a legitimate alarm does go offâto say, for example, that the parking lot is about to be flooded with acidâhe doesnât do anything about it.
Thereâs still a long way to go and new people to teach. âWe donât care so much about reducing numbers,â notes DuPontâs Sands. âWe want to have the right alarms.â
At many facilities alarms are still an issue, and still need to be brought under control. âI would like to say with all the learnings that weâve gone through, with all the things weâve seen like with BP Texas CityâŚunfortunately, people come up who havenât been active with alarms,â Honeywellâs Brown says. âThey donât see the issue. Itâs just one alarm."
About the Author
Aaron Hand
Editor-in-Chief, ProFood World

Leaders relevant to this article: