dlstft

Deep learning short-time Fourier transform

Since R2021a

collapse all in page

Syntax

y = dlstft(x)

y = dlstft(x,fs)

y = dlstft(x,ts)

y = dlstft(___,Name,Value)

[y,f,t] = dlstft(___)

Description

y = dlstft(x) returns the deep learning Short-Time Fourier Transform (STFT) of x. dlstft requires Deep Learning Toolbox™.

example

y = dlstft(x,fs) returns the deep learning STFT assuming that x was sampled at the rate fs.

example

y = dlstft(x,ts) returns the deep learning STFT assuming that x was sampled with sample time ts.

example

y = dlstft(___,Name,Value) specifies additional options using name-value arguments. Options include the spectral window and the FFT length. These arguments can be added to any of the previous input syntaxes. For example, 'DataFormat','CBT' specifies the data format of x as CBT.

example

[y,f,t] = dlstft(___) returns the frequencies f and times t at which the deep learning STFT is computed.

example

Examples

collapse all

Deep Learning Short-Time Fourier Transform of Chirp

This example uses:

Open Live Script

Generate a signal sampled at 600 Hz for 2 seconds. The signal consists of a chirp with sinusoidally varying frequency content.

fs = 6e2;
t = 0:1/fs:2;
x = vco(sin(2*pi*t),[0.1 0.4]*fs,fs);

Store the signal in an unformatted deep learning array. Compute the short-time Fourier transform of the signal. Input the sample time as a duration scalar. (Alternatively, input the sample rate as a numeric scalar.) Specify that the input array is in 'CTB' format.

dlx = dlarray(x);

[y,f,t] = dlstft(dlx,seconds(1/fs),'DataFormat','CTB');

Convert the outputs to numeric arrays. Compute the magnitude of the short-time Fourier transform and display it as a waterfall plot.

y = extractdata(y);
f = extractdata(f);
t = seconds(t);

waterfall(f,t,squeeze(abs(y))')

ax = gca;
ax.XDir = 'reverse';
view(30,45)

ylabel('Time (s)')
xlabel('Frequency (Hz)')
zlabel('Magnitude')

Figure contains an axes object. The axes object with xlabel Frequency (Hz), ylabel Time (s) contains an object of type patch.

Deep Learning Short-Time Fourier Transform of Sinusoid

This example uses:

Open Live Script

Generate a 3-by-160(-by-1) array containing one batch of a three-channel, 160-sample sinusoidal signal. The normalized sinusoid frequencies are $π / 4$ rad/sample, $π / 2$ rad/sample, and $3 π / 4$ rad/sample. Save the signal as a dlarray, specifying the dimensions in order. dlarray permutes the array dimensions to the 'CBT' shape expected by a deep learning network. Display the array dimension sizes.

x = dlarray(cos(pi.*(1:3)'/4*(0:159)),'CTB');
[nchan,nbtch,nsamp] = size(x)

nchan = 
3

nbtch = 
1

nsamp = 
160

Compute the deep learning short-time Fourier transform of the signal. Specify a 64-sample rectangular window and an FFT length of 1024.

[y,f,t] = dlstft(x,'Window',rectwin(64),'FFTLength',1024);

dlstft computes the transform along the 'T' dimension. The output array is in 'SCBT' format. The 'S' dimension corresponds to frequency in the short-time Fourier transform. Extract the data from the deep learning array.

y = extractdata(y);
f = extractdata(f);
t = extractdata(t);

Compute the magnitude of the short-time Fourier transform. Plot the magnitude separately for each channel in a waterfall plot.

z = abs(squeeze(y));

for kj = 1:nchan
    subplot(nchan,1,kj)
    waterfall(f/pi,t,squeeze(z(:,kj,:))')
    view(30,45)
end

xlabel('Frequency (\times\pi rad/sample)')
ylabel('Samples')

$Figure contains 3 axes objects. Axes object 1 contains an object of type patch. Axes object 2 contains an object of type patch. Axes object 3 with xlabel Frequency (\times\pi rad/sample), ylabel Samples contains an object of type patch.$

Input Arguments

collapse all

`x` — Input array
`dlarray` object | numeric array

Input array, specified as an unformatted dlarray (Deep Learning Toolbox) object, a formatted dlarray object in 'CBT' format, or a numeric array. If x is an unformatted dlarray or a numeric array, you must specify the 'DataFormat' as some permutation of 'CBT'.

Example: dlarray(cos(pi./[4;2]*(0:159)),'CTB') and dlarray(cos(pi./[4;2]*(0:159))','TCB') both specify one batch observation of a two-channel sinusoid in the 'CBT' format.

`fs` — Sample rate
2π (default) | positive numeric scalar

Sample rate, specified as a positive numeric scalar.

`ts` — Sample time
`duration` scalar

Sample time, specified as duration scalar. Specifying ts is equivalent to setting a sample rate f_s = 1/ts.

Example: seconds(1) is a duration scalar representing a 1-second time difference between consecutive signal samples.

Name-Value Arguments

collapse all

Specify optional pairs of arguments as Name1=Value1,...,NameN=ValueN, where Name is the argument name and Value is the corresponding value. Name-value arguments must appear after other arguments, but the order of the pairs does not matter.

Before R2021a, use commas to separate each name and value, and enclose Name in quotes.

Example: 'Window',hamming(100),'OverlapLength',50,'FFTLength',128 windows the data using a 100-sample Hamming window, with 50 samples of overlap between adjoining segments and a 128-point FFT.

`DataFormat` — Input data format
character vector | string scalar

Input data format, specified as a character vector or string scalar. This argument is valid only if x is unformatted.

Each character in this argument must be one of these labels:

C — Channel
B — Batch observations
T — Time

The dlstft function accepts any permutation of 'CBT'. You can specify at most one of each of the C, B, and T labels.

Each element of the argument labels the matching dimension of x. If the argument is not in the listed order ('C' followed by 'B' and so on), then dlstft implicitly permutes both the argument and the data to match the order, but without changing how the data is stored.

Example: 'CBT'

`Window` — Spectral window
`hann(128,'periodic')` (default) | vector

Spectral window, specified as a vector. If you do not specify the window or specify it as empty, the function uses a Hann window of length 128. The length of 'Window' must be greater than or equal to 2.

For a list of available windows, see Windows.

Example: hann(N+1) and (1-cos(2*pi*(0:N)'/N))/2 both specify a Hann window of length N + 1.

Data Types: double | single

`OverlapLength` — Number of overlapped samples
`75%` of window length (default) | nonnegative integer

Number of overlapped samples, specified as a nonnegative integer smaller than the length of 'Window'. If you omit 'OverlapLength' or specify it as empty, it is set to the largest integer less than 75% of the window length, which is 96 samples for the default Hann window.

Data Types: double | single

`FFTLength` — Number of discrete Fourier transform (DFT) points
`128` (default) | positive integer

Number of DFT points, specified as a positive integer. The value must be greater than or equal to the window length. If the length of the input signal is less than the DFT length, the data is padded with zeros.

Data Types: double | single

Output Arguments

collapse all

`y` — Short-time Fourier transform
formatted `dlarray` object | unformatted `dlarray` object

Short-time Fourier transform, returned as a formatted or unformatted dlarray (Deep Learning Toolbox) object.

If x is a formatted dlarray, then y is an 'SCBT' formatted dlarray object. The 'S' dimension corresponds to frequency in the short-time Fourier transform.
If x is an unformatted dlarray or a numeric array, then y is an unformatted dlarray object. The dimension order in y is 'SCBT'.

If no time information is specified, then the STFT is computed over the Nyquist range [0, π] if 'FFTLength' is even and over [0, π) if 'FFTLength' is odd. If you specify time information, then the intervals are [0, f_s/2] and [0, f_s/2), respectively, where f_s is the effective sample rate.

`f` — Frequencies
`dlarray` object

Frequencies at which the deep learning STFT is computed, returned as a dlarray object.

If the input array does not contain time information, then the frequencies are in normalized units of rad/sample.
If the input array contains time information, then f contains frequencies expressed in Hz.

`t` — Times
`dlarray` object | `duration` array

Times at which the deep learning STFT is computed, returned as a dlarray object or a duration array.

If you do not specify time information, then t contains sample numbers.
If you specify a sample rate, then t contains time values in seconds.
If you specify a sample time, then t is a duration array with the same time format as x.

More About

collapse all

Short-Time Fourier Transform

The short-time Fourier transform (STFT) is used to analyze how the frequency content of a nonstationary signal changes over time. The magnitude squared of the STFT is known as the spectrogram time-frequency representation of the signal. For more information about the spectrogram and how to compute it using Signal Processing Toolbox™ functions, see Spectrogram Computation with Signal Processing Toolbox.

The STFT of a signal is computed by sliding an analysis window g(n) of length M over the signal and calculating the discrete Fourier transform (DFT) of each segment of windowed data. The window hops over the original signal at intervals of R samples, equivalent to L = M – R samples of overlap between adjoining segments. Most window functions taper off at the edges to avoid spectral ringing. The DFT of each windowed segment is added to a complex-valued matrix that contains the magnitude and phase for each point in time and frequency. The STFT matrix has

$k = ⌊ \frac{N_{x} - L}{M - L} ⌋$

columns, where N_x is the length of the signal x(n) and the ⌊⌋ symbols denote the floor function. The number of rows in the matrix equals N_DFT, the number of DFT points, for centered and two-sided transforms and an odd number close to N_DFT/2 for one-sided transforms of real-valued signals.

The mth column of the STFT matrix $X (f) = [\begin{matrix} X_{1} (f) & X_{2} (f) & X_{3} (f) & \dots & X_{k} (f) \end{matrix}]$ contains the DFT of the windowed data centered about time mR:

$X_{m} (f) = \sum_{n = - \infty}^{\infty} x (n) g (n - m R) e^{- j 2 π f n} .$

Extended Capabilities

expand all

C/C++ Code Generation
Generate C and C++ code using MATLAB® Coder™. (since R2025a)

Usage notes and limitations:

The DataFormat name-value argument must be a compile-time constant.

GPU Code Generation
Generate CUDA® code for NVIDIA® GPUs using GPU Coder™. (since R2025a)

Refer to the usage notes and limitations in the C/C++ Code Generation section. The same limitations apply to GPU code generation.

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

This function fully supports GPU arrays. For more information, see Run MATLAB Functions on a GPU (Parallel Computing Toolbox).

Version History

Introduced in R2021a

expand all

R2025a: C/C++ and GPU Code Generation

The dlstft function supports:

C/C++ code generation. You must have MATLAB^® Coder™ to generate C/C++ code.
Code generation for NVIDIA^® GPUs. You must have GPU Coder™ to generate GPU code.

R2023b: `dlstft` combines real and imaginary parts of transform into one output argument

Starting in R2023b, dlstft combines the real and imaginary parts of the STFT into a single output argument. Previously, the function returned the real part of the transform as the first output argument and the imaginary part as the second. Starting this release, the function returns as second and third output arguments the frequencies and times at which the STFT is computed, respectively. Function calls with four output arguments error out.

Original Code in R2023a or Earlier	Result	Updated Code in R2023b
`[yr,yi] = dlstft(x);`	Runs, but the second output argument has a different interpretation	`y = dlstft(x); yr = real(y); yi = imag(y);`
`[yr,yi,f,t] = dlstft(x);`	Errors	`[y,f,t] = dlstft(x); yr = real(y); yi = imag(y);`

dlstft

Syntax

Description

Examples

Deep Learning Short-Time Fourier Transform of Chirp

Deep Learning Short-Time Fourier Transform of Sinusoid

Input Arguments

`x` — Input array
`dlarray` object | numeric array

`fs` — Sample rate
2π (default) | positive numeric scalar

`ts` — Sample time
`duration` scalar

Name-Value Arguments

`DataFormat` — Input data format
character vector | string scalar

`Window` — Spectral window
`hann(128,'periodic')` (default) | vector

`OverlapLength` — Number of overlapped samples
`75%` of window length (default) | nonnegative integer

`FFTLength` — Number of discrete Fourier transform (DFT) points
`128` (default) | positive integer

Output Arguments

`y` — Short-time Fourier transform
formatted `dlarray` object | unformatted `dlarray` object

`f` — Frequencies
`dlarray` object

`t` — Times
`dlarray` object | `duration` array

More About

Short-Time Fourier Transform

Extended Capabilities

C/C++ Code Generation
Generate C and C++ code using MATLAB® Coder™. (since R2025a)

GPU Code Generation
Generate CUDA® code for NVIDIA® GPUs using GPU Coder™. (since R2025a)

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

Version History

R2025a: C/C++ and GPU Code Generation

R2023b: `dlstft` combines real and imaginary parts of transform into one output argument

See Also

Objects

Functions

Topics

dlstft

Syntax

Description

Examples

Deep Learning Short-Time Fourier Transform of Chirp

Deep Learning Short-Time Fourier Transform of Sinusoid

Input Arguments

x — Input array dlarray object | numeric array

fs — Sample rate 2π (default) | positive numeric scalar

ts — Sample time duration scalar

Name-Value Arguments

DataFormat — Input data format character vector | string scalar

Window — Spectral window hann(128,'periodic') (default) | vector

OverlapLength — Number of overlapped samples 75% of window length (default) | nonnegative integer

FFTLength — Number of discrete Fourier transform (DFT) points 128 (default) | positive integer

Output Arguments

y — Short-time Fourier transform formatted dlarray object | unformatted dlarray object

f — Frequencies dlarray object

t — Times dlarray object | duration array

More About

Short-Time Fourier Transform

Extended Capabilities

C/C++ Code Generation Generate C and C++ code using MATLAB® Coder™. (since R2025a)

GPU Code Generation Generate CUDA® code for NVIDIA® GPUs using GPU Coder™. (since R2025a)

GPU Arrays Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

Version History

R2025a: C/C++ and GPU Code Generation

R2023b: dlstft combines real and imaginary parts of transform into one output argument

See Also

Objects

Functions

Topics

`x` — Input array
`dlarray` object | numeric array

`fs` — Sample rate
2π (default) | positive numeric scalar

`ts` — Sample time
`duration` scalar

`DataFormat` — Input data format
character vector | string scalar

`Window` — Spectral window
`hann(128,'periodic')` (default) | vector

`OverlapLength` — Number of overlapped samples
`75%` of window length (default) | nonnegative integer

`FFTLength` — Number of discrete Fourier transform (DFT) points
`128` (default) | positive integer

`y` — Short-time Fourier transform
formatted `dlarray` object | unformatted `dlarray` object

`f` — Frequencies
`dlarray` object

`t` — Times
`dlarray` object | `duration` array

C/C++ Code Generation
Generate C and C++ code using MATLAB® Coder™. (since R2025a)

GPU Code Generation
Generate CUDA® code for NVIDIA® GPUs using GPU Coder™. (since R2025a)

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

R2023b: `dlstft` combines real and imaginary parts of transform into one output argument