Machine Life Expectancy – What should the maintenance organization focus on?
You can also find this article it in MRO Magazine’s December Digital Edition here.
Recently I have been seeing the P to F interval curve popping up a lot on my LinkedIn feed and in articles that I have read. It was a concept that I was first introduced to when I was implementing Reliability Centred Maintenance into the Engineering and Maintenance department at the plant where I worked at the time. It was a great idea, that if done correctly is maintenance benefit. Why, because its cost savings and cost avoidance. Let me explain this.
Fig 1. The P to F Interval Curve.
The P to F curve was used as a learning tool for Condition Based Maintenance. The curve is the life expectancy of a machine, an asset. The P is the point when a change in the condition of the machine is detected. The F is when it reaches functional failure. This means that it is not doing the job it was designed to do. For example, if it were a seal that is designed to keep fluids in and contamination out and is now leaking, its in a state of functional failure. Will this put the machine down? Probably not, but it depends on the importance of the seal and the application. This is an important point because the P (potential failure) is a fixed point when you detect the change in condition but the F (failure) is a moving point. Not all warnings of failure put the machine down very often you have options and time.
Consider this: If I have a bucket that has a hole in it, it is in a functional failure state. But can I still use it to bail out my sinking boat? You bet I can!
Failure comes at us in many ways and obviously we have many ways to combat it. If you detect the potential failure early enough (and it can be months and months before actual failure) it means that you can avoid the breakdown. You can schedule an outage to do a repair. It’s not a breakdown, the machine hasn’t stopped, it’s not downtime. This is cost avoidance and the plant can save on the interrupted loss of production because of downtime costs.
There are a lot of examples of cost avoidance and also of cost savings. For instance, at the plant I worked at we used ultrasound to monitor bearings. We detected a very early warning in the sound level and were able to grease the bearing and the sound level dropped. We saved the bearing of any damage, we saved a potential breakdown so this is cost savings. Even if there is some bearing damage, the fact that we are aware and monitoring the situation lets us avoid any secondary damage.
It’s one price to replace a seal and its more if you have to replace a bearing in a gearbox. However, it can be very expensive to have to replace a shaft because the bearing has sized onto it ruined it. Secondary, ancillary damage can mount up very quickly if you don’t heed the warning you are given with the P of potential failure.
This warning of potential failure gives you time before any breakdown. The earlier the detection, the more time. Time to plan, view your options. And what people tend not to do is failure analysis while the machine is still in service. A failure analysis gives you a great start on seeking out the root cause but start right away, not when the machine is down.
Condition monitoring or as its often call Condition based maintenance (CBM) does work. However, for me there is a down side to this and I will explain why shortly. CBM is based on measurement, which is good because we all know to control a process we must measure.
Fig 2. You may see the P to F curve compartmentalized like this one (see sections below). However, the whole curve is the life expectancy of the machine and we monitor it using Condition Based Maintenance techniques.
Consultants (and I’m guilty) like to put labels on things and you may see:
- Design, Capability, Precision Maintenance.
- CBM, Predictive Maintenance
- Preventive Maintenance.
- Run to Failure, Breakdown Maintenance.
For me the P to F interval curve starts when the machine starts. That means Design and Precision Maintenance is not in the curve and this happens before startup. A small point but it takes away from the interval meaning.
We use predictive maintenance technologies in CBM. Vibration, Ultrasonic, Infrared, Oil Analysis, NDT (i.e. pipe wall thickness) and Operational Performance. They are all very good technologies, yet it is a combination of cross-technologies that works best. As an example, vibration may give you the most information yet ultrasound may give you the earliest warning on a high-speed bearing. And then there is oil analysis which may be best for a low-speed gearbox. It all depends on the application you have which dictates what’s best for you. A lot of time and effort was placed on having the best CBM program and to buy the right technology.
This, I believe, lead to the maintenance departments putting the focus on Condition based maintenance!
This I think is wrong because we still have failure. This means that CBM is no better than Predictive Maintenance. This doesn’t mean that I don’t recommend CBM, I do. To me it’s a must have but it does not improve the maintenance process because you still have machine failure.
Machine failures fall into three categories Premature failure, Random failure and Age-related failure.
We want the latter of these. We know from studies that say that 11% of machine assets fail because of age-related issues. They grow old and wear out. This means that 89% fail because of some other fault. This is a good thing because it gives us an opportunity to do something about them.
These numbers come from a very famous study by Nowlan and Heap (Google it!) that was commissioned by the US Defence Department. It doesn’t mean these numbers are an exact refection for every industry but the study but it has stood the test of time and I believe it has lead to the development of Reliability Centered Maintenance. But let’s say its wrong and let us double the amount they say is age related (full machine life expectancy). That would make it 22% and 78% would be the amount of random failures. Even if we quadruple it its only 44% meaning random is at 56% and we are still on the wrong side of the equation. The maintenance goal has to be to get the full life expectancy for all their machine assets.
In order to get the full life expectancy for a machine unit I think you have to be assured of two things. One is the design of the unit which includes all related parts (not just the pump but the piping as well). The other is the installation.
Fig 3. The most important part of the life expectancy of a machine is the design and installation of the machine.
If you’re like me, and you believe that Condition Based Maintenance starts when the machine starts then you understand that there is a section of the machines life that happens before. You could make an argument that it starts when you buy it because, as we all know, how we store it can have an effect. However, what is important at this stage is the design and installation of the machine. In most cases, we do not design the pump, gearbox or compressor but we do size them so that they meet the required output (hopefully). We do quite often design the piping configuration or the bases for example. All of which is very important but the reality is that maintenance departments maintain already-in-place machine assets. So, although a new installation, requiring design work is not often done, installation is.
Remove and Refit is done constantly. And the installation is something that you can control. In fact, it’s the installation that has the largest influence on the machines life. The goal is to create a stress-free environment for the machine to run in. No pipe strain, no distorted bases, no thermal expansion, no misalignment, etc.
Precision Maintenance was a term I first heard thirty years ago. Its part of our M.A.A.D. training program (Measure, Analyse, Action and Documentation). It’s simple, it means working to a standard. Maintenance departments can set their own standards. However, all must agree on it and adhere to it. This is the only way to control the installation process. This is the way to stop random failure and get the full life expectancy for your machine assets. The issue is that we do not have a general machinery installation standard to work to. Yes, we can and use information from other specific industry sources such as the American Petroleum Institute (API) or the information from the OEM (both of these are guidelines) however nothing for the general industry as a whole. Well this is about to change. The American National Standard Institute (ANSI) has just approved a new standard which is about to be published. I know this because I worked on it and will be writing about it shortly.
If you look at the life cycle of a machine, we need to know and manage the failure as best we can. If we only focus or mainly focus on the failure, we will not improve the reliability of the machine. We cannot control the failure. What we can control is the installation and done correctly this will improve the process giving the optimum life for the machine.
I sell laser shaft alignment tools as well a vibration instruments. If a customer were to buy a vibration monitoring tool before they bought a laser system. I would think their focus is on the effect of the issue not the cause. What do think?
You can also find this article it in MRO Magazine’s December Digital Edition here.