Navigation

Lenses

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

This content is ...

Endorsed by (What does "Endorsed by" mean?)

This content has been endorsed by the organizations listed. Click each link for a list of all content endorsed by the organization.

IEEE-SPS
Tags
- signal
- processing
- speech
- spectrogram
This module is included inLens: IEEE Signal Processing Society Lens
By: IEEE Signal Processing SocietyAs a part of collection: "Speech Signal Analysis"
Comments:
"Collection for undergraduates interested in speech processing featuring the linear speech production model."
Click the "IEEE-SPS" link to see all content they endorse.
Click the tag icon to display tags associated with this content.

Affiliated with (What does "Affiliated with" mean?)

This content is either by members of the organizations listed or about topics related to the organizations listed. Click each link to see a list of all content affiliated with the organization.

OrangeGrove
Tags
- engineering
- electrical-engineering
This module is included inLens: Florida Orange Grove Textbooks
By: Florida Orange GroveAs a part of collection: "Fundamentals of Electrical Engineering I"
Click the "OrangeGrove" link to see all content affiliated with them.
Click the tag icon to display tags associated with this content.
Rice DSS - Braille
Tags
- EE
- DSP
- Braille
This module is included inLens: Rice University Disability Support Services's Lens
By: Rice University Disability Support ServicesAs a part of collection: "Fundamentals of Electrical Engineering I"
Comments:
"Electrical Engineering Digital Processing Systems in Braille."
Click the "Rice DSS - Braille" link to see all content affiliated with them.
Click the tag icon to display tags associated with this content.
Rice Digital Scholarship
Tags
- textbook
- electrical-engineering
This module is included in aLens by: Digital Scholarship at Rice UniversityAs a part of collections: "Speech Signal Analysis", "Fundamentals of Electrical Engineering I"
Click the "Rice Digital Scholarship" link to see all content affiliated with them.
Click the tag icon to display tags associated with this content.
Bookshare
This module is included inLens: Bookshare's Lens
By: Bookshare - A Benetech InitiativeAs a part of collection: "Fundamentals of Electrical Engineering I"
Comments:
"Accessible versions of this collection are available at Bookshare. DAISY and BRF provided."
Click the "Bookshare" link to see all content affiliated with them.
Featured Content
Tags
- math
- science
- electrical
- engineering
This module is included inLens: Connexions Featured Content
By: ConnexionsAs a part of collection: "Fundamentals of Electrical Engineering I"
Comments:
"The course focuses on the creation, manipulation, transmission, and reception of information by electronic means. It covers elementary signal theory, time- and frequency-domain analysis, the […]"
Click the "Featured Content" link to see all content affiliated with them.
Click the tag icon to display tags associated with this content.

Also in these lenses

Lens for Engineering
This module is included inLens: Lens for Engineering
By: Sidney Burrus
Click the "Lens for Engineering" link to see all content selected in this lens.
SigProc
Tags
This module is included inLens: Signal Processing
By: Daniel McKennaAs a part of collection: "Fundamentals of Signal Processing"
Click the "SigProc" link to see all content selected in this lens.
Click the tag icon to display tags associated with this content.

Related material

Collections using this module

Recently Viewed: This feature requires Javascript to be enabled.

Tags

(What is a tag?)

These tags come from the endorsement, affiliation, and other lenses that include this content.

Download

Download module as:

PDF
EPUB (what's this?)

What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

Downloading to a reading device

For detailed instructions on how to download this content's EPUB to your specific device, click the "(what's this?)" link.
More downloads ...

Reuse / Edit

Module:

Reuse or edit Login Required

(How do I reuse or edit?)

Check out and edit

If you have permission to edit this content, using the "Reuse / Edit" action will allow you to check the content out into your Personal Workspace or a shared Workgroup and then make your edits.

Derive a copy

If you don't have permission to edit the content, you can still use "Reuse / Edit" to adapt the content by creating a derived copy of it and then editing and publishing the copy.

Add to a lens

Add module to:

A lens I own Login Required

(What is a lens?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

Add to Favorites

Add module to:

My Favorites Login Required

(What is My Favorites?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

Spectrograms

Module by: Don Johnson. E-mail the author

Summary: Spectrograms visually represent the speach signal, and the calculation of the Spectrogram is briefly explained.

We know how to acquire analog signals for digital processing (pre-filtering, sampling, and A/D conversion) and to compute spectra of discrete-time signals (using the FFT algorithm), let's put these various components together to learn how the spectrogram shown in Figure 1, which is used to analyze speech, is calculated. The speech was sampled at a rate of 11.025 kHz and passed through a 16-bit A/D converter.

Point of interest:

Music compact discs (CDs) encode their signals at a sampling rate of 44.1 kHz. We'll learn the rationale for this number later. The 11.025 kHz sampling rate for the speech is 1/4 of the CD sampling rate, and was the lowest available sampling rate commensurate with speech signal bandwidths available on my computer.

Exercise 1

Looking at Figure 1 the signal lasted a little over 1.2 seconds. How long was the sampled signal (in terms of samples)? What was the datarate during the sampling process in bps (bits per second)? Assuming the computer storage is organized in terms of bytes (8-bit quantities), how many bytes of computer memory does the speech consume?

Solution

Number of samples equals 1.2×11025=13230 1.2 11025 13230 . The datarate is 11025×16=176.4 11025 16 176.4 kbps. The storage required would be 2646026460 bytes.

**Figure 1**
Speech Spectrogram

The resulting discrete-time signal, shown in the bottom of Figure 1, clearly changes its character with time. To display these spectral changes, the long signal was sectioned into frames: comparatively short, contiguous groups of samples. Conceptually, a Fourier transform of each frame is calculated using the FFT. Each frame is not so long that significant signal variations are retained within a frame, but not so short that we lose the signal's spectral character. Roughly speaking, the speech signal's spectrum is evaluated over successive time segments and stacked side by side so that the xx-axis corresponds to time and the yy-axis frequency, with color indicating the spectral amplitude.

An important detail emerges when we examine each framed signal (Figure 2).

**Figure 2:** The top waveform is a segment 1024 samples long taken from the beginning of the "Rice University" phrase. Computing Figure 1 involved creating frames, here demarked by the vertical lines, that were 256 samples long and finding the spectrum of each. If a rectangular window is applied (corresponding to extracting a frame from the signal), oscillations appear in the spectrum (middle of bottom row). Applying a Hanning window gracefully tapers the signal toward frame edges, thereby yielding a more accurate computation of the signal's spectrum at that moment of time.
Spectrogram Hanning vs. Rectangular

At the frame's edges, the signal may change very abruptly, a feature not present in the original signal. A transform of such a segment reveals a curious oscillation in the spectrum, an artifact directly related to this sharp amplitude change. A better way to frame signals for spectrograms is to apply a window: Shape the signal values within a frame so that the signal decays gracefully as it nears the edges. This shaping is accomplished by multiplying the framed signal by the sequence w⁢n

w n

. In sectioning the signal, we essentially applied a rectangular window: w⁢n=1

w n 1

, 0≤n≤N−1

0 n N1

. A much more graceful window is the Hanning window; it has the cosine shape w⁢n=12⁢(1−cos2⁢π⁢nN)

w n 1 2 1 2 n N

. As shown in Figure 2, this shaping greatly reduces spurious oscillations in each frame's spectrum. Considering the spectrum of the Hanning windowed frame, we find that the oscillations resulting from applying the rectangular window obscured a formant (the one located at a little more than half the Nyquist frequency).

Exercise 2

What might be the source of these oscillations? To gain some insight, what is the length- 2⁢N 2 N discrete Fourier transform of a length-NN pulse? The pulse emulates the rectangular window, and certainly has edges. Compare your answer with the length- 2⁢N 2 N transform of a length- N N Hanning window.

Solution

The oscillations are due to the boxcar window's Fourier transform, which equals the sinc function.

**Figure 3:** In comparison with the original speech segment shown in the upper plot, the non-overlapped Hanning windowed version shown below it is very ragged. Clearly, spectral information extracted from the bottom plot could well miss important features present in the original.
Non-overlapping windows

If you examine the windowed signal sections in sequence to examine windowing's effect on signal amplitude, we see that we have managed to amplitude-modulate the signal with the periodically repeated window (Figure 3). To alleviate this problem, frames are overlapped (typically by half a frame duration). This solution requires more Fourier transform calculations than needed by rectangular windowing, but the spectra are much better behaved and spectral changes are much better captured.

The speech signal, such as shown in the speech spectrogram, is sectioned into overlapping, equal-length frames, with a Hanning window applied to each frame. The spectra of each of these is calculated, and displayed in spectrograms with frequency extending vertically, window time location running horizontally, and spectral magnitude color-coded. Figure 4 illustrates these computations.

**Figure 4:** The original speech segment and the sequence of overlapping Hanning windows applied to it are shown in the upper portion. Frames were 256 samples long and a Hanning window was applied with a half-frame overlap. A length-512 FFT of each frame was computed, with the magnitude of the first 257 FFT values displayed vertically, with spectral amplitude values color-coded.
Overlapping windows for computing spectrograms

Exercise 3

Why the specific values of 256 for N N and 512 for K K? Another issue is how was the length-512 transform of each length-256 windowed frame computed?

Solution

These numbers are powers-of-two, and the FFT algorithm can be exploited with these lengths. To compute a longer transform than the input signal's duration, we simply zero-pad the signal.

Content actions

Give feedback:

E-mail the module author

Download module as:

PDF | EPUB (?)

What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

Downloading to a reading device

For detailed instructions on how to download this content's EPUB to your specific device, click the "(?)" link.

| More downloads ...

Add module to:

My Favorites Login Required (?)

| A lens I own Login Required (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks

Reuse / Edit:

Reuse or edit module Login Required (?)

Check out and edit

If you have permission to edit this content, using the "Reuse / Edit" action will allow you to check the content out into your Personal Workspace or a shared Workgroup and then make your edits.

Derive a copy

If you don't have permission to edit the content, you can still use "Reuse / Edit" to adapt the content by creating a derived copy of it and then editing and publishing the copy.

Footer

More about this module: Metadata | Downloads | Version History

This work is licensed by Don Johnson under a Creative Commons Attribution License (CC-BY 3.0), and is an Open Educational Resource.

Last edited by Don Johnson on May 17, 2013 10:00 am -0500.

Connexions

Navigation

Lenses

Definition of a lens

Lenses

What is in a lens?

Who can create a lens?

What are tags?

This content is ...

Endorsed by (What does "Endorsed by" mean?)

Affiliated with (What does "Affiliated with" mean?)

Also in these lenses

Related material

Collections using this module

Recently Viewed

Tags

Download module as:

What is an EPUB file?

Downloading to a reading device

Module:

Check out and edit

Derive a copy

Add module to:

Definition of a lens

Lenses

What is in a lens?

Who can create a lens?

What are tags?

Add module to:

Spectrograms

Point of interest:

Exercise 1

Solution

Exercise 2

Solution

Exercise 3

Solution

Content actions

Share content

Share module:

Give feedback:

Download module as:

What is an EPUB file?

Downloading to a reading device

Add module to:

Definition of a lens

Lenses

What is in a lens?

Who can create a lens?

What are tags?

Reuse / Edit:

Check out and edit

Derive a copy

Footer