Seminars 2 and 3. EEG analysis

Seminars 2 and 3. EEG analysis#

Plan

Part 1.

Preparation and Basic Analysis

Load data (PhysioNet EEG Motor Movement/Imagery).
Channel selection & montage.
Spectral check & filtering.
Visualization of raw signals.
Event detection & epoching.
Quality control.
Evoked responses (ERP) (averaging, comparison of rest/left/right).

Part 2.

ICA and Artifact Removal

Run ICA (decompose into independent components).
Identify artifacts.
Exclude ICs and clean data (apply ICA, recompute ERP).

Global Measures and ERP Visualization

Global Field Power (GFP) to detect peaks.
ERP visualization (plot_joint, plot_topomap).

Functional Connectivity

Concept of coherence (spectral connectivity).
Compute coherence (rest vs left vs right).
Visualize networks (connectivity matrices, topomap graphs).
Beta band focus (12–20 Hz) on motor cortex (C3/Cz/C4).

Part 1#

All preprocessing and some data analysis of EEG data can be done using the Python library MNE.

Data: loading/reading PhysioNet EEG Motor Movement/Imagery. For demonstration, we use a single recording: S003R03.edf (+ .event).#

# For Colab only
# ! pip install mne
# ! pip install mne-connectivity

import warnings
warnings.filterwarnings('ignore')
import mne
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import matplotlib
import seaborn as sns
import mne_connectivity

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
Input In [2], in <cell line: 3>()
      1 import warnings
      2 warnings.filterwarnings('ignore')
----> 3 import mne
      4 import numpy as np
      5 import pandas as pd

ModuleNotFoundError: No module named 'mne'

# not in colab
# %matplotlib notebook
# in colab
%matplotlib inline

mne.io includes the funtions for different EEG-record formats: https://mne.tools/stable/documentation/implementation.html#supported-data-formats

We will work with data for one patient from EEG Motor Movement/Imagery Dataset.

!wget "https://www.physionet.org/files/eegmmidb/1.0.0/S003/S003R03.edf"

--2025-09-02 03:44:39--  https://www.physionet.org/files/eegmmidb/1.0.0/S003/S003R03.edf
Распознаётся www.physionet.org (www.physionet.org)… 18.18.42.54
Подключение к www.physionet.org (www.physionet.org)|18.18.42.54|:443... соединение установлено.
HTTP-запрос отправлен. Ожидание ответа… 200 OK
Длина: 2596896 (2,5M) [application/octet-stream]
Сохранение в: «S003R03.edf»

S003R03.edf         100%[===================>]   2,48M   564KB/s    за 4,4s    

2025-09-02 03:44:44 (582 KB/s) - «S003R03.edf» сохранён [2596896/2596896]

!wget "https://www.physionet.org/files/eegmmidb/1.0.0/S003/S003R03.edf.event"

--2025-09-02 03:44:44--  https://www.physionet.org/files/eegmmidb/1.0.0/S003/S003R03.edf.event
Распознаётся www.physionet.org (www.physionet.org)… 18.18.42.54
Подключение к www.physionet.org (www.physionet.org)|18.18.42.54|:443... соединение установлено.
HTTP-запрос отправлен. Ожидание ответа… 200 OK
Длина: 638 [text/plain]
Сохранение в: «S003R03.edf.event»

S003R03.edf.event   100%[===================>]     638  --.-KB/s    за 0s      

2025-09-02 03:44:45 (26,5 MB/s) - «S003R03.edf.event» сохранён [638/638]

# !ls

sample = mne.io.read_raw_edf('S003R03.edf', verbose=False, preload=True)

Get some info about a record

sample.info

	General
	MNE object type	Info
	Measurement date	2009-08-12 at 16:15:00 UTC
	Participant	X
	Experimenter	Unknown
	Acquisition
	Sampling frequency	160.00 Hz
	Channels
	EEG
	Head & sensor digitization	Not available
	Filters
	Highpass	0.00 Hz
	Lowpass	80.00 Hz

print("Sampling freq:", sample.info["sfreq"], "Hz")

Sampling freq: 160.0 Hz

print("Duration (s):", len(sample) / sample.info["sfreq"])

Duration (s): 125.0

print("N channels:", len(sample.ch_names))

N channels: 64

Reflection: data exploration

How many channels and what is the sampling frequency? Why is it important to know this?
What modality and experimental paradigm are used?
What artifacts can be expected and why?

Channel selection and adding a montage#

sample.ch_names[:3]

['Fc5.', 'Fc3.', 'Fc1.']

Fixing trailing dots in channel names

In some EEG recordings, channel names contain trailing dots (e.g., Fc3.).
Fix this using the rename_channels method.

# create a mapping: old name → new name (without dot)
map = {}
for ch in sample.ch_names:
    map[ch] = ch.replace(".", "")

# apply renaming
sample.rename_channels(map)

# check the first channels
print(sample.ch_names[:10])

map = {}
for ch in sample.ch_names:
    map[ch] = ch.replace(".", "")
sample.rename_channels(map)

	General
	Filename(s)	S003R03.edf
	MNE object type	RawEDF
	Measurement date	2009-08-12 at 16:15:00 UTC
	Participant	X
	Experimenter	Unknown
	Acquisition
	Duration	00:02:05 (HH:MM:SS)
	Sampling frequency	160.00 Hz
	Time points	20,000
	Channels
	EEG
	Head & sensor digitization	Not available
	Filters
	Highpass	0.00 Hz
	Lowpass	80.00 Hz

sample.ch_names[:3]

['Fc5', 'Fc3', 'Fc1']

Why select exactly these 19 channels?

We restrict the analysis to the classic 19 electrodes of the 10–20 system.
This ensures comparability with textbooks and standard ERP protocols.

A1/A2 are not included → they are ear electrodes, not always present.
MNE uses an extended version of the 10–20 montage (closer to 10–10), so some labels differ:
- T3 = T7, T4 = T8
- T5 = P7, T6 = P8
Why this matters: when reading old papers (T3/T4 notation) or working with datasets using different conventions, you must map them consistently.
Takeaway: consistent channel selection = reproducible and interpretable analysis.

channels_to_use = [
    # prefrontal
    'Fp1',
    'Fp2',
    # frontal
    'F7',
    'F3',
    'F4',
    'Fz',
    'F8',
    # central and temporal
    'T7',
    'C3',
    'Cz',
    'C4',
    'T8',
    # parietal
    'P7',
    'P3',
    'Pz',
    'P4',
    'P8',
    # occipital
    'O1',
    'O2',
]

sample_1020 = sample.copy().pick(channels_to_use)

# check that everything is OK
assert len(channels_to_use) == len(sample_1020.ch_names)

ten_twenty_montage = mne.channels.make_standard_montage('standard_1020')
len(ten_twenty_montage.ch_names) #This command shows how many channels are embedded in the standard 10–20 montage we loaded. In MNE it is an extended version, so the number will be much greater than 19.

mne.channels.make_standard_montage(‘standard_1020’)#

Creates a montage object (electrode layout) for the standard 10–20 system.
It contains 3D coordinates (x, y, z) of each electrode on the scalp.
Essentially, it is a “map” of electrode positions.

sample_1020.set_montage(ten_twenty_montage)

	General
	Filename(s)	S003R03.edf
	MNE object type	RawEDF
	Measurement date	2009-08-12 at 16:15:00 UTC
	Participant	X
	Experimenter	Unknown
	Acquisition
	Duration	00:02:05 (HH:MM:SS)
	Sampling frequency	160.00 Hz
	Time points	20,000
	Channels
	EEG
	Head & sensor digitization	22 points
	Filters
	Highpass	0.00 Hz
	Lowpass	80.00 Hz

raw_1020.set_montage(mont)#

Assigns this map to the raw_1020 object.
Each EEG channel now has a known spatial position.
Required for topomaps, sensor plots, connectivity

sample_1020.plot_sensors(show_names=True)

../_images/3cc55b7bab0e95fa0ebf4f676dfa117e4c50376ed8f28b8e214d30008e47954a.png

Reflection: montage

Q1. Why is it important to standardize/normalize channel names before analysis?
Q2. Why do we need to set a montage? What problems arise for topomaps if we skip it?
Q3. Which of the selected 10–20 channels are located over the motor cortex?
Q4. How can you distinguish temporal from occipital channels in the 10–20 scheme?

Checking and Cleaning EEG Spectrum#

We first inspect the power spectrum (PSD) to detect line noise and drifts.

sample_1020.compute_psd().plot()

Effective window size : 12.800 (s)
Plotting power spectral density (dB=True).

../_images/b07edf98abcc83055a78aa1fa3cb364b674cb9062ba2e18b5ab9367065f13d29.png

1. Main signal energy is concentrated in low frequencies (<10 Hz)#

This reflects basic EEG physiology: in humans, most of the power lies in slow oscillations.

1–4 Hz (delta) — dominant during sleep, but also partly present as background during wakefulness.
4–8 Hz (theta) — associated with drowsiness and attention.
8–13 Hz (alpha) — typical for occipital areas, especially with eyes closed.

The fact that low frequencies dominate the spectrum confirms that this is real EEG, not just noise.

2. Weak 50/60-Hz peak#

In real recordings, a spike at the power line frequency (50 Hz in Europe, 60 Hz in North America) is often visible.

Here it is weak, which means:

either the equipment was well shielded,
or the artifact was attenuated by the filter (h_freq=50 (60)).

➡️ This is convenient: a separate notch filter is not required.

What The notch filter does: it is a targeted filter that removes a narrow band around a single frequency (e.g., exactly 50 or 60 Hz).
Why: the power grid operates at a fixed frequency (50/60 Hz), and this activity “leaks” into EEG recordings as a characteristic spike in the spectrum.
Problem: this activity is not brain-related but can mask real neural rhythms.
Solution: the notch filter selectively removes this noise while preserving neighboring frequencies (e.g., the alpha rhythm at 10 Hz or the beta rhythm at 20 Hz).

Band-pass filtering#

➡️ We apply a band-pass filter (1–50 Hz) to preserve only brain rhythms and remove drift and noise. For this, we use a digital IIR filter, which works efficiently and is applied in MNE in a safe zero-phase mode without time distortion.

sample_1020.filter(l_freq=1, h_freq=50, method='iir')

Filtering raw data in 1 contiguous segment
Setting up band-pass filter from 1 - 50 Hz

IIR filter parameters
---------------------
Butterworth bandpass zero-phase (two-pass forward and reverse) non-causal filter:
- Filter order 16 (effective, after forward-backward)
- Cutoffs at 1.00, 50.00 Hz: -6.02, -6.02 dB

	General
	Filename(s)	S003R03.edf
	MNE object type	RawEDF
	Measurement date	2009-08-12 at 16:15:00 UTC
	Participant	X
	Experimenter	Unknown
	Acquisition
	Duration	00:02:05 (HH:MM:SS)
	Sampling frequency	160.00 Hz
	Time points	20,000
	Channels
	EEG
	Head & sensor digitization	22 points
	Filters
	Highpass	1.00 Hz
	Lowpass	50.00 Hz

Ploting psd after filtering

sample_1020.compute_psd().plot()

sample_1020.compute_psd().plot()

Effective window size : 12.800 (s)
Plotting power spectral density (dB=True).

../_images/10985d9f7df87078ce560a020cf2e31ef389a68a575765e7c017bc78cdb65c11.png

Reflection: filtering

Q1. What is the difference between a notch and a band-pass filter? Why are both steps useful?
Q2. Why is it better to filter after setting the montage?
Q3. What does 1–50 Hz mean for EEG? Which physiological rhythms are preserved or removed?
Q4. Why is drift <1 Hz undesirable for ERP analysis?

Plot EEG signals#

We display several EEG channels on the screen for 20 seconds to visually check signal quality and detect artifacts that are not always visible in the spectrum. However, it is important to adjust the scalings parameter for proper visualization.

sample_1020.plot(n_channels=8, duration=20)

Using matplotlib as 2D backend.

../_images/bc6f60c8535bc25f64270e5b813322e28b462a82f9e4cd7488196b90267eddf4.png

# Plot in better scale. Use 'scalings' argument
sample_1020.plot(n_channels=8, duration=20, scalings=3e-4)

../_images/0afcea4470998fb0600bf043075fed061d7bdc2cac026ec5fb1d8e5ecdc95279.png

Event Detection: Triggers vs Annotations#

In EEG processing, correctly identifying events (e.g., stimulus onsets, task markers) is critical for epoching and downstream analysis. MNE-Python offers different tools depending on how these events are encoded in your file format:

Use mne.find_events when events are stored in a dedicated trigger channel (common in FIFF/BDF formats).
Use mne.events_from_annotations when events are embedded as annotations/markers (typical for EDF+ or BrainVision with annotations).

Why it matters for your data#

In our case, we’re working with EDF+ files (e.g., S003R03.edf). That means the events are stored as annotations, not trigger channels—so we should use mne.events_from_annotations() to properly extract them.

Next, we’ll load the events and create epochs around them for further analysis.

events, events_dict = mne.events_from_annotations(sample_1020)

Used Annotations descriptions: [np.str_('T0'), np.str_('T1'), np.str_('T2')]

events_dict

{np.str_('T0'): 1, np.str_('T1'): 2, np.str_('T2'): 3}

events[:5]

array([[   0,    0,    1],
       [ 672,    0,    3],
       [1328,    0,    1],
       [2000,    0,    2],
       [2656,    0,    1]])

Event array format in MNE:

[\text{sample_index},\ \text{previous_event_code},\ \text{current_event_code}]

sample_index – the time point (in samples) when the event occurs.
previous_event_code – the code of the event/state before the change (often 0 in EDF+).
current_event_code – the new event code that marks what actually happened.

👉 In EDF+ data, the middle column is usually 0 because events come from annotations, not from state changes.

Creating Epochs#

Once we have identified the events of interest in our continuous Raw data, the next logical step is to segment the continuous EEG into epochs — short, equal-duration chunks of signal that are time-locked to each event. This is crucial because averaging across many such epochs allows us to reveal consistent brain responses (ERPs) while reducing random noise.

In MNE, the Raw object together with the events array is the minimal input needed to create an Epochs object with the mne.Epochs class constructor. However, in practice we rarely use the defaults: we almost always want to specify the time window relative to each event, using the parameters tmin and tmax (start and end time of each epoch relative to the event). This ensures that our epochs capture both the pre-stimulus baseline and the post-stimulus response of interest.

We take 500 ms before the event as a baseline and 800 ms after the event to capture the main brain responses. This is a typical window for ERP analysis.

epochs = mne.Epochs(sample_1020, events,  tmin=-0.5, tmax=0.8, preload=True)

Not setting metadata
30 matching events found
Setting baseline interval to [-0.5, 0.0] s
Applying baseline correction (mode: mean)
0 projection items activated
Using data from preloaded Raw for 30 events and 209 original time points ...
1 bad epochs dropped

We convert the list of events into a table and count how many times each event type occurs.

pd.DataFrame(epochs.events, columns=['_', '__', 'event_id'])['event_id'].value_counts()

event_id
1    14
3     8
2     7
Name: count, dtype: int64

This is a quick way to check the class balance before creating Epochs.

# Take the first epoch from the Epochs object
for epoch in epochs:
    break
print(epoch.shape)  
# Output: (19, 209)
# -> 19 EEG channels × 209 time points
# Each epoch is stored as a 2D NumPy array (channels × samples)

# Compute the duration of a single epoch in seconds
epoch_duration = epoch.shape[1] / sample_1020.info['sfreq']
print(epoch_duration)  
# 209 samples / 160 Hz sampling rate ≈ 1.3 seconds
# This matches the chosen window: tmin = -0.5 s, tmax = 0.8 s

# Convert the raw object into a pandas DataFrame
print(sample_1020.to_data_frame().shape)  
# (20000, 20)
# -> 20,000 time points × 20 channels
# Each column corresponds to an EEG channel (plus 'time' if included)
# This format is convenient for quick inspection, plotting with pandas,
# or exporting EEG data to other analysis pipelines.

(19, 209)
1.30625
(20000, 20)

df = epochs.to_data_frame()
df.head(3).iloc[:, :10]

	time	condition	epoch	Fp1	Fp2	F7	F3	F4	Fz	F8
0	-0.50000	3	1	236.675891	244.275703	125.936913	88.829681	112.445086	73.438632	145.805904
1	-0.49375	3	1	175.970007	183.425016	103.803572	63.009365	82.687750	48.730906	115.749079
2	-0.48750	3	1	127.095181	132.985975	61.669835	31.478465	62.299189	33.384794	93.870855

What we learn:

We can see the raw numeric values (in µV) for each channel at each time point.
Easy to check that metadata (condition, epoch) are aligned correctly.

Peak-to-peak amplitude (QC)#

df[sample_1020.ch_names + ['epoch']].groupby('epoch').agg(lambda arr: arr.max() - arr.min()).hist(figsize=[10, 10]);
plt.tight_layout()

../_images/12345d189b9e237dd2e5194f1dfa39615d404d51d3496709f2e218d4258527bc.png

What we learn:

A quick QC step: typical amplitudes and outliers.
If Fp1/Fp2 show very large values → likely eye blinks or EOG.
Helps justify setting a rejection threshold (e.g., reject={‘eeg’: 600e-6}).

epochs = mne.Epochs(sample_1020, events,  tmin=-0.5, tmax=0.8, reject={'eeg': 600e-6}, preload=True, baseline=(-.1, 0))

Not setting metadata
30 matching events found
Applying baseline correction (mode: mean)
0 projection items activated
Using data from preloaded Raw for 30 events and 209 original time points ...
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp2']
    Rejecting  epoch based on EEG : ['Fp2']
    Rejecting  epoch based on EEG : ['Fp2']
    Rejecting  epoch based on EEG : ['Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
    Rejecting  epoch based on EEG : ['Fp1', 'Fp2']
15 bad epochs dropped

What we learn:

How many epochs are rejected and why.
Clean dataset for ERP analysis.

PSD of epochs#

Compute and plot the Power Spectral Density across epochs.

epochs.plot_psd()

NOTE: plot_psd() is a legacy function. New code should use .compute_psd().plot().
    Using multitaper spectrum estimation with 7 DPSS windows
Plotting power spectral density (dB=True).
Averaging across epochs before plotting...

../_images/7ad920ceb6b622fae538e74182330cc790150c302108f2c8755f5d57d274dfed.png

What we learn:

EEG shows the typical 1/f spectrum.
<1 Hz drift and >50 Hz noise are cut off.
No sharp spike at 50/60 Hz → line noise is not dominant.

Check that baseline correction and filtering worked properly

epochs.plot(n_channels=8, scalings={'eeg':3e-4})

../_images/ce835df874f6db472794d229d7ba7b2b64a923923c635bf387ecc5f455a30c3a.png

epochs.event_id

{'1': 1, '2': 2, '3': 3}

Checking number of events of each type (use epochs.events)

pd.DataFrame(epochs.events, columns=['_', '__', 'event_id'])['event_id'].value_counts()```

pd.DataFrame(epochs.events, columns=['_', '__', 'event_id'])['event_id'].value_counts()

event_id
3    6
1    5
2    4
Name: count, dtype: int64

Averaging (Evoked responses)#

evoked_T0 = epochs['1'].average()
evoked_T1 = epochs['2'].average()
evoked_T2 = epochs['3'].average()

evoked_T0.plot(spatial_colors=True)

../_images/0c0405bbea29cb843886694b1bc46d9f1881b513727fa274c94fb0f792a9f784.png

evoked_T1.plot(spatial_colors=True)

../_images/b99210c2c796a78e6979c4acf0428f5fb8e2a6eba8f3bb063ca5f2988a23e8f3.png

evoked_T2.plot(spatial_colors=True)

../_images/76c795be56ac0891cce2bd8d16ee65a639e821225ad85666bf82834f1f2aeb47.png

Compute condition-specific averages (ERPs).
Plot with spatial_colors=True so channels are colored by location.

What we learn:

Inspect ERP morphology and possible lateralization (C3/Cz/C4 for motor tasks).
Check N_avg in the top corner = how many epochs survived rejection.
If T1 vs T2 have very different N_avg, keep this in mind for statistical comparisons.

Part 2#

Independent Component Analysis (ICA) for Artifact Removal#

Independent Component Analysis (ICA) is a method for estimating independent sources from mixed signals.

In EEG, ICA is applied to identify and remove artifacts — signal components not generated by neural activity.

Common Types of EEG Artifacts#

Artifacts can be divided into physiological (originating from the human body but not the brain) and non-physiological/technical (external or recording-related):

Physiological artifacts
- Ocular activity (blinks, saccades)
- Muscle activity (EMG from face, jaw, or neck)
- Cardiac activity (ECG leakage)
- Perspiration (sweat-induced drifts)
- Respiration (movement-related fluctuations)
Non-physiological / Technical artifacts
- Electrode pops (sudden impedance changes)
- Cable movement (mechanical noise)
- Incorrect reference electrode placement
- AC line noise (50/60 Hz) and electromagnetic interference
- Gross body movements

Further resources:

ica = mne.preprocessing.ICA(n_components=10, random_state=42)

At this stage, no computation is performed yet — we are only creating an empty ICA object with chosen parameters.
Think of it as preparing a container where the following will be stored later:

Mixing matrix (A): how independent components (ICs) project back into EEG channels.
Unmixing matrix (W): how EEG channels can be decomposed into independent components.
IC time series and topographies after fitting.

ica.fit(sample_1020)

Fitting ICA to data using 19 channels (please be patient, this may take a while)
Selecting by number: 10 components
Fitting ICA took 0.2s.

Method	fastica
Fit parameters	algorithm=parallel fun=logcosh fun_args=None max_iter=1000
Fit	17 iterations on raw data (20000 samples)
ICA components	10
Available PCA components	19
Channel types	eeg
ICA components marked for exclusion	—

The actual unmixing of EEG channels into independent components happens only when we call:

ica.fit(sample_1020)

Explained variance ratio#

What it does: estimates how much of the EEG variability is explained by the selected ICs.
Why: a sanity check — to verify whether the chosen number of components is sufficient and whether important signal is preserved in reconstruction.
What to look for: values around ~0.95–0.99 for EEG are usually considered good.

explained_var_ratio = ica.get_explained_variance_ratio(sample_1020)
for channel_type, ratio in explained_var_ratio.items():
    print(f"Fraction of {channel_type} variance explained by all components: {ratio}")

Fraction of eeg variance explained by all components: 0.9738186007059333

Display the time courses of all independent components (IC000, IC001, …). The colored bars at the top correspond to experimental segments T0/T1/T2.

Why. To quickly spot repeating artifacts (blinks, EMG, ECG) and identify the intervals where they appear.

ica.plot_sources(sample_1020)

Creating RawArray with float64 data, n_channels=10, n_times=20000
    Range : 0 ... 19999 =      0.000 ...   124.994 secs
Ready.

../_images/bfa98ba97d206bf3301eca977b993056a66f13f3c08246d575b7b4fd202ab31e.png

What we learn:

What to look for.

Large sharp peaks repeating in bursts → typically eye blinks (EOG).
High-frequency “roughness” → muscle activity (EMG).
Regular ~1 Hz waves → ECG.
Stable, smooth traces with rhythmic patterns → usually neural ICs.

Typical conclusion. Select 1–3 “suspicious” ICs for detailed inspection in the next step.

Examine the scalp maps of IC contributions (mixing topographies)

ica.plot_components()

../_images/e47819e4af1c9f90787deb202595131310bb1ebbb26575132bf0c4d60e4c4ec1.png

../_images/82fec00da5a8d229fca6b9d174bc230ecf692614e54e4fa23020118ab6795a2b.png

What we learn:

Topography is the key to identifying the type of artifact:

Symmetry across the forehead (Fp1/Fp2) → EOG/blinks.
Localized patches near the temples/cheeks → EMG.
Dipole-like pattern closer to the center/lower head → possible ECG.
Occipital dominance with alpha activity → often a neural IC.

What to look for. Combine with the time-course analysis: suspicious temporal dynamics + a “matching” artifact topography = a strong candidate for exclusion.

Analyze IC007 in detail: topography, time course, PSD, and variance across segments.

Why. To distinguish neural components from artifact components using multiple criteria.

ica.plot_properties(sample_1020, picks=[7])

    Using multitaper spectrum estimation with 7 DPSS windows
Not setting metadata
62 matching events found
No baseline correction applied
0 projection items activated

../_images/eeaae85671ee9851feed3e78f23b5c54f25ff220844cbbf985141d784ac530a1.png

[<Figure size 700x600 with 6 Axes>]

What we learn:

What to look for.

Topography: is it physiologically plausible for brain/activity or for a specific artifact?
Spectrum (PSD): are there α (8–13 Hz) / β (13–30 Hz) peaks and an overall ~1/f falloff? A narrow spike at 50/60 Hz → line noise; a broad high-frequency floor → EMG.
Time series: any large, sharp transients (suggestive of EOG/blinks)?
Variance across segments: many outliers (instability → more likely an artifact).

Typical criterion. A “good” IC shows a 1/f decay, possible α/β peak, no large transients, and a plausible topography.

Temporarily zero out ICs with indices [0,1,2] and compare the signals before (red) and after (black).

Why. To make sure that removing the selected ICs reduces artifacts without distorting useful neural dynamics.

# blinks
ica.plot_overlay(sample_1020, exclude=[0,1,2])

Applying ICA to Raw instance
    Transforming to ICA space (10 components)
    Zeroing out 3 ICA components
    Projecting back using 19 PCA components

../_images/024585c921ee84fcbaa72424ef89ffb6f8e7a5a6500111c8bc9ccf7f20a5b95b.png

What we learn:

What to look for.

Blink/EMG peaks should be clearly reduced.
Physiological rhythms (e.g., alpha) should not disappear globally.

Important. This is only a preview — the raw data have not been modified yet.

Examine how excluding ICs [0,1] affects a specific channel (F3).

Why. To check the local effect in regions where artifacts are most noticeable (often frontal channels).

ica.plot_overlay(sample_1020, exclude=[0, 1], picks=['F3'])

Applying ICA to Raw instance
    Transforming to ICA space (10 components)
    Zeroing out 2 ICA components
    Projecting back using 19 PCA components

../_images/184e4fe22c7a9bc7464685c57534e20f710f91870211e585e433b7c141e4d19b.png

What we learn:

What to look for. Reduction of spikes/“flickering” in F3 while preserving the overall shape of the response.

ica.exclude = [0,1]

ICA000 (blue frontal component)

Topography: maximum strictly at the forehead, symmetric over Fp1/Fp2.
Time course (from your ica.plot_sources): rare, very large sharp peaks.
Spectrum: dominance of low frequencies, without regularity.

👉 This looks like EOG (blinks).

ICA001 (red “filled” component)

Topography: broader and deeper distribution, not only frontal but also extending downward/central.
Time course: peaks appear more regular, repeating roughly every ~1 second.
Spectrum: expected strong power around ~1–1.5 Hz.

👉 This is more consistent with ECG (heartbeat).

sample_1020_clr = sample_1020.copy()

ica.apply(sample_1020_clr)

Applying ICA to Raw instance
    Transforming to ICA space (10 components)
    Zeroing out 2 ICA components
    Projecting back using 19 PCA components

	General
	Filename(s)	S003R03.edf
	MNE object type	RawEDF
	Measurement date	2009-08-12 at 16:15:00 UTC
	Participant	X
	Experimenter	Unknown
	Acquisition
	Duration	00:02:05 (HH:MM:SS)
	Sampling frequency	160.00 Hz
	Time points	20,000
	Channels
	EEG
	Head & sensor digitization	22 points
	Filters
	Highpass	1.00 Hz
	Lowpass	50.00 Hz

# Apply the ICA to remove the chosen components from the data
cleaned_eeg = ica.apply(sample_1020_clr)

# Plot the cleaned signals after excluding the chosen artifacts
cleaned_eeg.plot(n_channels=8, duration=20, scalings=3e-4)

Applying ICA to Raw instance
    Transforming to ICA space (10 components)
    Zeroing out 2 ICA components
    Projecting back using 19 PCA components

../_images/f1db59bf6295c1c7683fddea4388ab0210c30d00940dd2f97eb9f0abf829b69f.png

epochs = mne.Epochs(sample_1020_clr, events,  tmin=-0.5, tmax=0.8, reject={'eeg': 600e-6}, preload=True, baseline=(-.1, 0))

Not setting metadata
30 matching events found
Applying baseline correction (mode: mean)
0 projection items activated
Using data from preloaded Raw for 30 events and 209 original time points ...
1 bad epochs dropped

evoked_T0 = epochs['1'].average()
evoked_T1 = epochs['2'].average()
evoked_T2 = epochs['3'].average()

evoked_T0.plot(spatial_colors=True)

../_images/bf02d69092f6f5eb5ec3d4118c76c5f3b8a1642094853841e0fcce88facbe871.png

evoked_T1.plot(spatial_colors=True)

../_images/f8aac711a96f928dc1e945fc4c1262875f78b7cc4109d65535b685e15d20f5d5.png

evoked_T2.plot(spatial_colors=True)

../_images/fb37edb52adbcb3515b40f49e78e66ef6bf851a4eba29cd4e07bc56e67bb74f6.png

Global Field Power (GFP)#

It is useful to look at Global Field Power (GFP).
GFP summarizes how much the EEG signals vary across all electrodes at each time point.

If all sensors record the same value, there is no spatial variability → GFP = 0.
If the sensors differ strongly, GFP increases → GFP > 0.

In practice, GFP peaks highlight moments of strong, widespread brain activity (for example, event-related potentials).
Therefore, GFP is often used as a quick overview to identify time periods that may be worth a closer look.

Let’s plot a comparison of averaged evoked responses (ERPs) across the three conditions:

rest (T0) — resting state,
left (T1) — imagination/movement of the left hand,
right (T2) — imagination/movement of the right hand.

mne.viz.plot_compare_evokeds(
    dict(rest=evoked_T0, left=evoked_T1, right=evoked_T2),
    legend="upper left",
    show_sensors="upper right",
)

combining channels using GFP (eeg channels)
combining channels using GFP (eeg channels)
combining channels using GFP (eeg channels)

../_images/fe9a8d52fda5a5bca112288841dfdfb3787249c6cc98d96fbf6b561072e89957.png

[<Figure size 800x600 with 2 Axes>]

Visualizing Evoked responses in more detail#

To explore evoked responses, MNE provides two useful plotting methods:

plot_joint
- Combines the ERP time course with topographic scalp maps.
- On the left: the ERP curve across time.
- On the right: scalp maps showing the spatial distribution of activity at specific time points.
- 👉 This helps to connect when something happens with where on the scalp it is strongest.
plot_topomap
- Shows only the scalp maps at selected time points.
- Useful if we want to focus on the spatial distribution of brain activity.

How to choose the time points#

We can set them manually, e.g. times=[0.1, 0.3, 0.5] (in seconds).
Or we can let MNE choose automatically with times='peaks'.
- In this case, MNE looks at the Global Field Power (GFP) curve.
- GFP measures how strong and synchronized the brain’s response is across the scalp.
- MNE finds the 3 strongest peaks in GFP.
- These peaks usually correspond to meaningful ERP components (like P100, N200, P300).

evoked_T0.plot_joint()
evoked_T0.plot_topomap(times=[0.0, 0.2, 0.3, 0.4, 0.8])

No projector specified for this dataset. Please consider the method self.add_proj.

../_images/c121282c7f80935d3bf75503cdf8f541c4e17b5a4195133cac3219b2332edf6d.png

../_images/dc1150154889ccf5aee7e252bfafbfd0efcb42beb224fa4b06c17d79c97493ad.png

../_images/e2e3d4d169529f6337f42ffc1c5ac5b040bd7196c1fb767700af855d4204c75c.png

evoked_T1.plot_joint()
evoked_T1.plot_topomap(times=[0.0, 0.2, 0.3, 0.4, 0.8])

No projector specified for this dataset. Please consider the method self.add_proj.

../_images/8cd0488818bfdc7e16c3a31544e36faf21a60035cb8b6e5dbd8b7097541b3216.png

../_images/5c8c9bb747e117ca9c37a5038426e7ecf1220fe4f811e7e8f66941f4c7aa6860.png

../_images/ad892c3e8892ab815a55e1d84ff3172a59215ed5028c7a6a5e9404ce5866f91c.png

evoked_T2.plot_joint()
evoked_T2.plot_topomap(times=[0.0, 0.2, 0.3, 0.4, 0.8])

No projector specified for this dataset. Please consider the method self.add_proj.

../_images/1ada9e9f7a5346863ad5427b214b5e48259744d4c9559d3438da393e016bae96.png

../_images/34ed89f50f84cc2f3b588547cd02ffc2bae67781c3208a13df117a8bd62dff80.png

../_images/bb5b9c114f7cc260547babcce2a4cce6a2b4c7297f1ecc3550f699d7daff5471.png

What we learn:

What to look for.

On the ERP time course: check where the peaks occur.
On the topomaps: see which brain regions (frontal, central, parietal, occipital) are most active at those times.

Functional Connectivity in EEG#

So far, we have mainly looked at local activity of the brain:

ERPs show the average response over time in each condition.
Topomaps reveal where on the scalp the strongest signals occur.
GFP summarizes overall synchrony across sensors.

But brain function is not only about local activations.
It also depends on interactions between different brain regions — how signals from one area are related to signals from another.
This is called functional connectivity.

Spectral Connectivity#

Connectivity is often estimated in the frequency domain:

We check if oscillations in two channels are synchronized at certain frequencies.
One popular measure is coherence (coh), which quantifies how consistently two signals oscillate together.
High coherence means the two brain regions are likely interacting or sharing information.

conn_T1 = mne_connectivity.spectral_connectivity_epochs(epochs['2'], method='coh')

Adding metadata with 3 columns
Connectivity computation...
only using indices for lower-triangular matrix
    computing connectivity for 171 connections
    using t=-0.500s..0.800s for estimation (209 points)
    frequencies: 4.6Hz..79.6Hz (99 points)
    Using multitaper spectrum estimation with 7 DPSS windows
    the following metrics will be computed: Coherence
    computing cross-spectral density for epoch 1
    computing cross-spectral density for epoch 2
    computing cross-spectral density for epoch 3
    computing cross-spectral density for epoch 4
    computing cross-spectral density for epoch 5
    computing cross-spectral density for epoch 6
    computing cross-spectral density for epoch 7
    assembling connectivity matrix
[Connectivity computation done]

What we learn:

What to look for.

Which pairs of channels show the highest coherence?
In which frequency bands (e.g., alpha, beta) is connectivity strongest?
How does connectivity differ between conditions (rest vs. left vs. right)?

This function transforms the connectivity matrix (NxN) into a graph on the scalp, making it easy to see which brain areas work synchronously.

def plot_topomap_connectivity_2d(info, con, picks=None, pairs=None, vmin=None, vmax=None, cm=None, show_values=False, show_names=True):
    """
    Plots connectivity-like data in 2d

    Drawing every pair of channels will likely make a mess
    There are two options to avoid it:
    - provide picks
    - provide specific pairs of channels to draw
    """

    # get positions
    _, pos, _, _, _, _, _ = mne.viz.topomap._prepare_topomap_plot(info, 'eeg');

#     if picks is None and pairs is None:
#         picks = info.ch_names

    ch_names_lower = [ch.lower() for ch in info.ch_names]
    if picks:
        picks_lower = [ch.lower() for ch in picks]
    if pairs:
        pairs_lower = [tuple(sorted([ch1.lower(), ch2.lower()])) for ch1, ch2 in pairs]

    rows = []
    for idx1, ch1 in enumerate(ch_names_lower):
        for idx2, ch2 in enumerate(ch_names_lower):
            if ch1 >= ch2:
                continue
            if picks and (ch1 not in picks_lower or ch2 not in picks_lower):
                    continue
            if pairs and (ch1, ch2) not in pairs_lower:
                    continue
            rows.append((
                pos[idx1],
                pos[idx2],
                con[idx1, idx2]
            ))

    if not len(rows):
        raise ValueError('No pairs to plot')

    con_to_plot = np.array([row[2] for row in rows])
    if vmin is None:
        vmin = np.percentile(con_to_plot, 2)
    if vmax is None:
        vmax = np.percentile(con_to_plot, 98)
    norm = matplotlib.colors.Normalize(vmin=vmin, vmax=vmax)

    if cm is None:
        cm = sns.diverging_palette(240, 10, as_cmap=True)

    fig, ax = plt.subplots(figsize=[5, 5])
    mne.viz.utils.plot_sensors(info, show_names=show_names, show=False, axes=ax);
    for row in rows:
        if (row[2] > vmin) and  (row[2] < vmax):
          rgba_color = cm(norm(row[2]))
          plt.plot([row[0][0], row[1][0]], [row[0][1], row[1][1]], color=rgba_color)
          if show_values:
              plt.text((row[0][0] + row[1][0]) / 2,
                      (row[0][1] + row[1][1]) / 2,
                      '{:.2f}'.format(row[2]))

What we can see in the figure:

Black dots → electrode positions (Fp1, Fp2, Cz, O1, etc.).
Lines between them → coherence values.
Red/thicker lines → stronger connections.
Blue/thinner lines → weaker connections.

Compute spectral connectivity (coherence) for each condition

conn_T0 = mne_connectivity.spectral_connectivity_epochs(epochs['1'], method='coh', verbose=False)
conn_T1 = mne_connectivity.spectral_connectivity_epochs(epochs['2'], method='coh', verbose=False)
conn_T2 = mne_connectivity.spectral_connectivity_epochs(epochs['3'], method='coh', verbose=False)

For rest (T0), left hand movement/imagery (T1), and right hand movement/imagery (T2), we compute coherence between all pairs of EEG channels.
Output: a 3D matrix → (channels × channels × frequencies).

Check the shape of the result

conn_T0.get_data(output="dense").shape

(19, 19, 99)

19 EEG channels × 19 EEG channels × 99 frequency bins.
This is the full coherence tensor across the scalp.

Average across frequencies (1–50 Hz)

conn_T0_all = conn_T0.get_data(output="dense")[:, :, 1:50].mean(axis=2)
conn_T0_all = conn_T0_all + conn_T0_all.T

conn_T1_all = conn_T1.get_data(output="dense")[:, :, 1:50].mean(axis=2)
conn_T1_all = conn_T1_all + conn_T1_all.T

conn_T2_all = conn_T2.get_data(output="dense")[:, :, 1:50].mean(axis=2)
conn_T2_all = conn_T2_all + conn_T2_all.T

Select frequencies from 1 to 50 Hz.
Average coherence values over this range → one mean connectivity matrix (19×19).
Add its transpose to make it symmetric (since coherence is theoretically symmetric).

The same procedure is applied for T1 and T2.

Visualize the connectivity on the scalp

plot_topomap_connectivity_2d(epochs.info, conn_T0_all, picks=epochs.ch_names, vmin=0.5)

../_images/4ee8a207daebdfbb79b912751cbd4571e9f68679550993533c5cdddf43ed75b6.png

Take the coherence matrix (e.g., for T0).
Plot electrodes as nodes (black dots) and coherence values as edges (lines).
Only strong connections above 0.5 are displayed:
- red/thicker lines = strong coherence,
- blue/thinner lines = weaker coherence.

What we learn:

What to look for.

When plotting connectivity for T1 (left hand) and T2 (right hand), adjust the parameter vmin.
vmin defines the minimum connectivity value shown on the color scale.
By increasing vmin, only the strongest connections are displayed, making it easier to see condition-specific differences.

How to interpret.

Compare the patterns of strong links (red edges) between T1 and T2.
Focus on whether the central electrodes (C3/C4) show stronger coherence with other areas depending on the imagined hand movement.

plot_topomap_connectivity_2d(epochs.info, conn_T1_all, picks=epochs.ch_names,  show_values=True, vmin=0.85)

../_images/fdd1b7f7d02f2461106a9011744543fc488fb67ef0ca9544eaf5b64e6a66074c.png

plot_topomap_connectivity_2d(epochs.info, conn_T2_all, picks=epochs.ch_names,  show_values=True, vmin=0.85)

../_images/d071a8b0fd7d1500cae6222e454c916a4e6f209c027e526b5933ce21b325cbe2.png

Beta-band coherence analysis

What we do.

Compute functional connectivity (coherence) in the beta band (12–20 Hz).
Focus only on selected pairs of channels instead of plotting all-to-all connections.
Use the helper function plot_topomap_connectivity_2d with the argument
pairs=[('F7', 'F4'), ('O2', 'T7'), ...].

Why.

Beta activity (12–20 Hz) is strongly linked to the motor cortex.
Coherence in this band reflects how well different brain regions are synchronized during motor tasks.
By restricting to 5–10 meaningful pairs, we highlight the most relevant motor and control connections without clutter.

What to look for.

Stronger coherence between C3/Cz/C4 and frontal/parietal regions → motor network engagement.
Differences between T1 (left hand) and T2 (right hand) conditions, reflecting lateralized motor control.
Red/thicker lines indicate strong synchronization; blue/thinner lines indicate weaker links.

Seminars 2 and 3. EEG analysis

Contents

Seminars 2 and 3. EEG analysis#

Part 1#

Data: loading/reading PhysioNet EEG Motor Movement/Imagery. For demonstration, we use a single recording: S003R03.edf (+ .event).#

Channel selection and adding a montage#

mne.channels.make_standard_montage(‘standard_1020’)#

raw_1020.set_montage(mont)#

Checking and Cleaning EEG Spectrum#

1. Main signal energy is concentrated in low frequencies (<10 Hz)#

2. Weak 50/60-Hz peak#

Band-pass filtering#

Plot EEG signals#

Event Detection: Triggers vs Annotations#

Why it matters for your data#

Creating Epochs#

Peak-to-peak amplitude (QC)#

PSD of epochs#

Averaging (Evoked responses)#

Part 2#

Independent Component Analysis (ICA) for Artifact Removal#

Common Types of EEG Artifacts#

Explained variance ratio#

Global Field Power (GFP)#

Visualizing Evoked responses in more detail#

How to choose the time points#

Functional Connectivity in EEG#

Spectral Connectivity#