Tensilica Sogggggggund
Tensilica Sogggggggund
Tensilica Sogggggggund
All of these factors have resulted in a high demand for a flexible, high-performance,
low-power audio DSPs that add sound to an SOCs design with the least amount of
design effort and a small on-chip footprint. Tensilicas HiFi Audio DSP family was
carefully crafted to meet these requirements for the broadest possible range of
consumer products. The HiFi 2 and HiFi EP Audio DSPs are available as an audioextension packages for the Xtensa LX configurable processor core and the HiFi 2
DSP is incorporated into the pre-configured 330HiFi processor core.
www.tensilica.com
Page 2
www.tensilica.com
Page 3
1
2
3
4
5
6
HiFi 2 Operations
Loads and Stores
Single/Dual Multiply with 56-bit Accumulator
Scalar and 2-Way SIMD ALU Operations
Variable/Immediate Shifts
Convert/Round/Truncate/Saturate Operations
Huffman Encode/Decode and Bit-Stream Support
These HiFi instructions are teamed with two audio-specific register files: an 8-entry
file named P with 48 bits/entry (each entry can hold two 24-bit audio values) and a
4-entry file named Q with 56-bit entries. The 56-bit values in the Q register file are
generated from a set of instructions that control a dual multiplier/accumulator
Tensilica White Paper
www.tensilica.com
Page 4
(MAC). Each of the two pipelined multipliers can perform a 24x24-bit or a 32x16-bit
multiplication with a throughput of one multiplication per multiplier per cycle. The
56-bit results obtained from the two multipliers are accumulated in the Q registers.
(Note: the 32x16-bit operation mode for the multipliers is particularly helpful for
high-precision arithmetic.)
Group 1 includes operations that load data from and store data to the P and
Q register files. These operations support immediate and indexed
addressing modes with and without automatic address register updating.
Group 2 instructions drive the HiFi 2 Audio Engines dual MAC. Audio
codec software uses these instructions to perform audio-stream transforms
between the time and frequency domains and for windowing and frequencyband splitting, sample-rate conversion, and special audio effects such as
reverb and three-dimensional sound simulation.
Group 3 operations perform scalar arithmetic and Boolean functions on 56bit words stored in the Q register file as well as 2-way SIMD arithmetic and
Boolean functions on the 48-bit (paired 24-bit) words stored in the P
register file.
www.tensilica.com
Page 5
A detailed examination of all of the HiFi operations in the five operation groups is
available in the technical documentation but a quick look at the HiFi MAC operation
group illustrates the flexibility of the Xtensa LX architectures FLIX-format
instructions. Figure 1 illustrates the HiFi instruction format.
63
27
26
Operation 1
0
Operation 0
64-bit
Operation
24-bit
Operation
Slot 1
16-bit
Slot 0
The MAC instructions can include the following functions and very complex
instructions can be built from these MAC primitives:
The base Xtensa LX RISC instructions use operation Slot 0 and are either 16 or 24
bits wide. The HiFi instructions generally use Slot 0 for its load and store operations
and for a few audio-processing DSP instructions that access to the processors base
32-bit register file. HiFi uses Slot 1 for most DSP operations.
Xtensa LX processors with a HiFi option and the 330HiFi core modelessly handle
variable-width instruction streams. This feature allows the associated C/C++
compilers code generator to select and freely intermix various instruction sizes to
minimize the size of the compiled code. The compiler automatically selects the
smallest instruction that will perform the required operation, which results in very
compact code and avoids the code bloat generally associated with multi-operation
VLIW processors.
Figure 2 provides a simplified view the HiFi register files and data path, organized
visually to match the 2-slot FLIX instruction format shown in Figure 1. The 16- and
24-bit instructions share the base RISC processors execution hardware (registers
and pipeline) with operations in the lower operation slot (called Slot 0). The rest of
the datapath extensions occupy Slot 1.
www.tensilica.com
Page 6
The base RISC processor instructions and the HiFi Audio DSPs 16- and 24-bit
instructions only control resources assigned to Slot 0 while the 64-bit HIFi
instructions can control all the processor resources in both slots. For 64-bit
instructions, the operation residing in the lower part of the 64-bit instruction word
controls processor resources assigned to Slot 0 and the operation in the upper part of
the instruction word (labeled Operation 1 in Figure 1) controls resources assigned to
Slot 1. Its important to note that there are two operation slots but there is only one
processor, which keeps the programming model simple and allows the audioenhanced processor to be a good target for the C/C++ compiler.
www.tensilica.com
Page 7
Gate
Count
Area
(mm2)
299
88,569
0.98
289
100,860
1.12
284
101, 408
1.13
270
110,012
1.22
Based on TSMC 130nm LV process, Artisan library, includes MUL32 Xtensa LX configuration
option not used in the final HiFi design
www.tensilica.com
Page 8
www.tensilica.com
Page 9
Because each of these audio packages requires very little processing bandwidth, HiFi
can run these audio codecs at very low clock rates to save power. Alternatively, the
processor can run several of these audio packages simultaneously at somewhat
higher clock rates.
All of the codecs listed in Table 3 are written in C. In fact, a primary HiFi design
goal was to create an audio platform that could be programmed in C while delivering
the desired real-time performance at low processor clock rates. This approach opens
software development to a much larger programming audience than for other
vendors audio solutions (which must be programmed in assembly language to
achieve performance goals). Because many more programmers are familiar with C
than with assembly-language coding, SOC design teams can draw on the much larger
base of C/C++ programmers by using HiFi.
HiFi Audios Extensive Low Power Features
Through a combination of significantly lower per-MHz power consumption and
architectural optimization of the instruction set, HiFi delivers dramatic
improvements in energy efficiency that result in increased battery life (which results
in more playing time) for portable and wireless applications. Based on TSMCs
65nm LP process and a minimal HiFi 2 configuration including memory, dynamic
and static power dissipation can be as low as 66 W/MHz and 69 W respectively.
Total power dissipation is only 0.45 mW while decoding a typical MP3 file at 5.7
MHz.
Tensilicas Xtensa LX configurable processor core and Tensilicas 330HiFi Audio
DSP, which is based on the Xtensa LX core, have many features that facilitate lowpower operation including functional clock gating and a variety of power-down and
sleep modes. Lower power and energy consumption is a key reason for using
configurable processor cores and instruction-set extensions to run audio codecs. By
adding appropriately tailored instructions to the processors ISA (instruction-set
architecture), the configured audio processor executes the target application code in
many fewer cycles. As a result, the processor core can execute the codec at a greatly
reduced clock frequency, which in turn cuts both power dissipation and energy
consumption.
Xtensa processor cores including the 330HiFi Audio DSP have two levels of clock
gating. The first level of clock gating is based on global conditions. For instance, the
WAITI instruction allows an Xtensa processor to enter a sleep mode that turns off
the clocks to nearly all of the processors internal registers. An interrupt wakes the
processor from sleep mode. In addition, the processors RunStall signal can still be
used to save power by allowing external logic to stall the processor pipeline and turn
off the clock to many of the processors registers.
The processors second level of clock gating is functional clock gating. Xtensa
processor cores including the 330HiFi Audio Engine contain hundreds of functional
blocks, identified through trillions of simulation cycles exercising all of the
Tensilica White Paper
www.tensilica.com
Page 10
Performance to Spare
For high-performance applications, processor cores based on Tensilicas Xtensa LX
processor core and the HiFi 2 Audio DSP can attain clock rates exceeding 320 MHz
in TSMCs 90nm G process, leaving plenty of headroom for other control, media,
and signal-processing tasks. The efficient Xtensa HiFi 2 Audio Engine architecture
requires only a small fraction of this available processor bandwidth to perform the
audio decoding and encoding functions.
www.tensilica.com
Page 11
to the systems output DACs via an output queue. Because of this, FIFO-queue
interfaces are good I/O choices in system designs that incorporate audio because the
queue interfaces separate the continuous flow of audio data from other bus traffic,
freeing valuable system-bus bandwidth at a very low hardware cost.
Conclusion
Most ASIC and SOC design teams working with on-chip audio simply want to add
audio as a drop-in component. Audio is certainly one of the SOCs important
features but design teams generally need to add value by spending development time
and resource on other product-specific features. Digital audio has become
sufficiently standardized so that it can now be added to the ASIC or SOC design as
an off-the-shelf component.
However, the only way to add audio as a component is to select a complete, readyto-use audio solution. There are many characteristics used to measure the
completeness of an audio solution. Tensilicas HiFi Audio DSPs and the 330HiFi
audio processor core are complete solutions that offer:
Easy design in. Many factors determine how easy it will be to design a
vendors audio solution into an ASIC or SOC. Among these are the
flexibility of the core, the interface flexibility of the audio block, simulation
support, and programming support.
www.tensilica.com
Page 12
Note: If you would like help adding digital audio to your next ASIC or SOC design,
contact Tensilica for a consultation. You might find our Audio Reference Design
application note valuable. See it on our web site at www.tensilica.com.
www.tensilica.com