Power Electronics / Power Management


Trends in cooling high-speed chips - automatic fan control

7 April 2004 Power Electronics / Power Management

Cooling fans are an important part of thermal management for high-powered chips and systems. Unfortunately, their use can sometimes raise a system's acoustic noise level to the point where it is objectionable.

By measuring temperature and adjusting fan speed accordingly, a fan's speed and noise level can be minimised when temperature is low, but increased under worst-case conditions to prevent damage. This article describes two techniques for automatically controlling a cooling fan's speed.

High-speed chips tend to run hot. As they get faster, they get hotter. New generation high-speed digital chips are made with smaller processes that allow the supply voltage to be reduced, which helps, but the number of transistors increases faster than the supply voltage decreases. Power levels, therefore, continue to rise.

As chip temperature increases, performance can suffer. Parameters shift, maximum operating frequencies decrease, and timing can fall out of specification. From the user's point of view, the product is no longer operating properly when this occurs. The first reason for cooling high-speed chips, therefore, is to maintain good performance for the longest possible operating time and over the widest possible range of environmental conditions. The maximum allowable temperature for a high speed chip to meet its parametric specifications depends on the process and how the chip is designed (ie, how 'close to the edge' the chip is operating), among other factors. Typical maximum die temperature values range from +90°C to +130°C.

Beyond the point where performance degradation begins, excessive die temperature causes catastrophic damage. The maximum die temperature limit is usually well over +120°C and depends on such factors as process, package, and duration of high-temperature conditions. High-speed chips are, therefore, cooled to avoid reaching such temperatures.

Techniques

A single cooling technique is rarely used. Instead, combinations of techniques are generally necessary to ensure high performance and continued reliability. Heatsinks, heat pipes, fans and clock throttling are commonly employed. The last two, fans and clock throttling, can help solve the heat problem, but introduce their own problems.

Fans can dramatically reduce the temperature of a high-speed chip, but they also generate much acoustic noise. This noise is annoying to some consumers and is also becoming a target of government agencies concerned about the long-term effects of noise in the workplace. Fan noise can be reduced significantly by varying the fan's speed based on temperature; the fan can turn slowly (and very quietly) when temperature is low, and can speed up as temperature increases.

Clock throttling - reducing clock speed to reduce power dissipation - works by reducing system performance. When throttling the clock, the system continues to function, but at a reduced speed. Clearly, in high-performance systems, throttling should be done only when it is absolutely necessary - that is, when the temperature reaches the point where functionality is about to be lost.

Temperature sensing

Controlling fan speed or clock throttling based on temperature requires that the temperature of the high-speed chip is measured first. This can be done by placing a temperature sensor close to the target chip - either directly next to it or, in some cases, under it or on the heatsink. The temperature measured this way corresponds to that of the high-speed chip, but can be significantly lower (up to around 30°), and the difference between measured temperature and die temperature increases as the power dissipation increases. Therefore, the temperature of the circuit board or heat sink must be correlated to the die temperature of the high-speed chip.

A better alternative is possible with a number of high-speed chips. Many CPUs, graphics chips, FPGAs, and other high-speed ICs include a 'thermal diode', which is actually a diode-connected bipolar transistor, on the die. Using a remote-diode temperature sensor connected to this thermal diode, the temperature of the high-speed IC's die can be directly measured with excellent accuracy. This not only eliminates the large temperature gradients involved in measuring temperature outside the target IC's package, but it also eliminates the long thermal time constants, from several seconds to minutes, that cause delays in responding to die temperature changes.

The need for fan control forces the designer to make several key choices. The first choice is the method of adjusting the fan's speed. A common method of adjusting the speed of a brushless DC fan is to regulate the power-supply voltage of the fan. This works well for power-supply voltages as low as about 40% of the nominal value. There is a drawback: If the power-supply voltage is varied using a linear pass device, the efficiency is poor. Better efficiency can be obtained using a switch-mode power supply for the fan, but this increases cost and component count.

Speed control

Another popular fan-speed control technique is to power the fan with a low-frequency PWM signal, usually in the range of about 30 Hz, whose duty cycle is varied to adjust the fan's speed. This is inexpensive because a single, small pass transistor can be used. It is efficient because the pass transistor is used as a switch. A disadvantage of this approach, however, is that it can make the fan somewhat noisier because of the pulsed nature of the power supply. The PWM waveform's fast edges cause the fan's mechanical structure to move (somewhat like a badly designed loudspeaker), which can easily be audible.

Another fan-control design choice is whether the fan's speed is measured as part of the control scheme. In addition to power and ground, many fans are available with a third wire that provides a 'tachometer' signal to the fan-control circuitry. The tachometer output produces a specified number of pulses (two pulses, for example) for each revolution of the fan. Some fan-control circuits use this tachometer waveform as a feedback signal that allows the fan's voltage or PWM duty cycle to be adjusted to give a desired RPM. A simpler approach ignores any tachometer signal and simply adjusts the fan's drive to speed up or slow down with no speed feedback. Speed control using this method is less precise, but cost is lower and at least one feedback loop is removed, simplifying the control system.

In some systems, it is important to limit the change rate of the fan speed. This is most critical when the system is in close proximity to users. Simply switching a fan on and off or changing speed immediately as temperature changes is acceptable in some environments. When users are nearby, however, sudden changes in fan noise are apparent and annoying. Limiting the rate of change of the fan's drive signal to an acceptable value (eg, 1% per second) ensures that the acoustic effects of fan control are minimised. The fan speed still changes, but it does so without attracting attention.

The fan-control profile is another important design variable. Typically, the fan is off below a specific threshold temperature and then begins to spin at a slow rate (for example, 40% of full speed) once the threshold is exceeded. As temperature increases, the fan's drive increases linearly with temperature until it reaches 100% drive. The best slope depends on system requirements. A more rapid slope results in somewhat more consistent chip temperature, but fan speed has more variation as power dissipation changes from one moment to the next. If highest performance is the goal, the starting temperature and the slope should be chosen so that the fan reaches full speed before the die temperature is high enough to initiate clock throttling.

Figure 1. Linear (DC-output) temperature sensor and automatic fan-speed controller. Fan speed is controlled automatically based on the temperature of the high-speed chip. Tachometer feedback from the fan allows the fan controller to regulate fan speed directly. System shutdown output prevents the high-speed chip from reaching destructive temperatures
Figure 1. Linear (DC-output) temperature sensor and automatic fan-speed controller. Fan speed is controlled automatically based on the temperature of the high-speed chip. Tachometer feedback from the fan allows the fan controller to regulate fan speed directly. System shutdown output prevents the high-speed chip from reaching destructive temperatures

Implementation

Implementing fan-control circuitry can be done in several ways. A variety of remote temperature sensors with up to five sensing channels is available that can detect the die temperature of the high-speed chip and transmit temperature data to a micro-controller. Fan speed regulators with multiple channels of fan tachometer monitoring can provide reliable control of fan RPM or supply voltage based on commands from an external microcontroller. For low cost and simple implementation, ICs are available with temperature sensing and automatic fan control included in a single package. Sensor/controllers also normally include overtemperature detection for clock throttling and system shutdown, thereby protecting the high-speed chips from catastrophic failure due to overheating.

Examples of two such ICs, one with DC drive and one with PWM drive, are shown in Figures 1 and 2. The IC in Figure 1 senses remote temperature and controls fan speed based on that temperature. It produces a DC supply voltage for the fan through an internal power transistor. Figure 2 shows an IC that performs a similar function, but drives the fan with a PWM waveform through an external pass transistor. Both include complete thermal fault monitoring with overtemperature outputs, which can be used to shut down the system if the high-speed chip gets too hot.

Figure 2. PWM-output temperature sensor and automatic fan-speed controller. Fan speed is controlled automatically based on temperature. Clock throttle and system shutdown outputs prevent a high-speed chip from reaching destructive temperatures. CRIT0 and CRIT1 pins can be strapped to supply or ground to select default shutdown-temperature thresholds, ensuring protection even when system software hangs
Figure 2. PWM-output temperature sensor and automatic fan-speed controller. Fan speed is controlled automatically based on temperature. Clock throttle and system shutdown outputs prevent a high-speed chip from reaching destructive temperatures. CRIT0 and CRIT1 pins can be strapped to supply or ground to select default shutdown-temperature thresholds, ensuring protection even when system software hangs



Credit(s)



Share this article:
Share via emailShare via LinkedInPrint this page

Further reading:

Powering the IoT
Uniross Batteries Power Electronics / Power Management
Choosing the right battery is not simply a technical detail, but a defining factor in the longevity and cost-effectiveness of any IoT deployment.

Read more...
The move to DC infrastructure
IOT Electronics Power Electronics / Power Management
The shift toward direct current infrastructure in data centres is accelerating as operators pursue higher efficiency, improved reliability, and stronger alignment with renewable energy goals.

Read more...
Power inductor for automotive applications
Würth Elektronik eiSos Power Electronics / Power Management
The WE-MXGA inductor features RDC values up to 25% lower and a saturation current up to 47% higher compared to other components currently available on the market.

Read more...
High-performance DC-DC converter
Conical Technologies Power Electronics / Power Management
With a wide 9 - 40 V DC input range, the Claf Power DWHB400-F24xxN series is a high-performance DC-DC converter family ideally suited for 12 V and 24 V battery-based systems including vehicles, off-grid installations, and distributed power architectures.

Read more...
DC-DC converters for next-generation IoT
iCorp Technologies Power Electronics / Power Management
AIPUPOWER’s K78XXJT-500R3 and K78XXJT-500R3-LB give designers a compact, high-efficiency power option for battery-powered and space-constrained systems, combining up to 96% efficiency with no-load input current as low as 0,2 mA.

Read more...
240 W in compact form factor
Brabek Power Electronics / Power Management
The cost-effective and reliable RECOM RACPRO1-S240E DIN-rail mount AC-DC series is only 125 x 139 mm and 39 mm wide, yet provides 240 W output.

Read more...
You cannot manage what you cannot measure
Power Electronics / Power Management
South African sites are generating solar energy. Most still have no idea what is consuming it… or when, or why. Intelligent load management hardware is changing that.

Read more...
Scalable and intelligent power solution
Power Electronics / Power Management
Designed for Africa’s rapidly evolving energy landscape, Bluetti’s ES125 offers a scalable, intelligent power solution that delivers reliable and cost-efficient electricity for uninterrupted operations.

Read more...
Real-time monitoring for smart power distribution
CST Electronics Test & Measurement
By leveraging high-precision measurement, high-speed wave-recording, and AI-enabled analytics, the InHand Wireless Overhead-lines System empowers power utilities to accurately identify line faults and perform comprehensive load analysis.

Read more...
Simulating grid abnormalities
Conical Technologies Power Electronics / Power Management
Simulating grid abnormalities has become an essential part of validating systems such as inverters, UPS units, photovoltaic inverters, and EV charging infrastructure.

Read more...









While every effort has been made to ensure the accuracy of the information contained herein, the publisher and its agents cannot be held responsible for any errors contained, or any loss incurred as a result. Articles published do not necessarily reflect the views of the publishers. The editor reserves the right to alter or cut copy. Articles submitted are deemed to have been cleared for publication. Advertisements and company contact details are published as provided by the advertiser. Technews Publishing (Pty) Ltd cannot be held responsible for the accuracy or veracity of supplied material.




© Technews Publishing (Pty) Ltd | All Rights Reserved