Hi there 👋

I’m Fahim Anjum, a postdoctoral researcher at UCSF passionate about machine learning, particularly in brain signal processing and large language models (LLMs). I have 9+ years of experience in designing ML algorithms, statistical models, and efficient features for time-series analysis, forecasting, and classification. To learn more, please visit my projects and publications.


🔭 Developing deep-learning models for brain signal classification.
🌱 Exploring scalable architectures in LLMs for healthcare applications.
👯 Open to collaborations at the intersection of machine learning, neuroscience, and healthcare.
🤔 Actively seeking job opportunities in machine learning and brain signal processing.
💬 Ask me about machine learning, deep learning, or EEG signal processing.
📫 Reach me at dr.fahim.anjum@gmail.com
😄 Pronouns: Fahim
⚡ Fun fact: I’m fascinated by the parallels between neural networks and the human brain!


About Me

Education

I hold a Ph.D. in Electrical and Computer Engineering from the University of Iowa, where I specialized in decoding brain signals (EEG, LFP) in Parkinson’s disease and analyzing neural pathways using Control theory. My research led to innovative features for accurate brain decoding, with a patent pending on Parkinson’s diagnosis via EEG data. I was honored to receive the Graduate College Post-comprehensive Research Fellowship and was featured in the ‘Dare to Discover Campaign’ Dare to Discover Campaign for my work.

Currently, I’m a postdoctoral scholar at Little Lab, UCSF, where I focus on developing adaptive Deep Brain Stimulation to restore normal sleep patterns in Parkinson’s patients. In 2021, I received the Computational Innovator Fellowship Grant from UCSF.

Expertise

Brain signal decoding, Signal processing, Computational models, Machine learning, Control theory, LLM, Generative Models, Statistical models

Hobbies

Playing musical instruments, skiing, and hiking.


Projects

Generative Models

Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance
Developed an enhanced diffusion model by integrating alias-free resampling techniques inspired by StyleGAN3, resulting in improved rotational equivariance and stable training. The model demonstrates superior performance across MNIST, CIFAR-10, and MNIST-M datasets without introducing new trainable parameters, maintaining simplicity and efficiency in architecture.
Tech Stack: Pytorch

LiPCoT: Linear Predictive Coding based Tokenizer for Self-supervised Learning of Time Series Data via Language Models
Designed a novel tokenizer that converts time series data of brain signals into a fixed vocabulary via stochastic modeling and utilized BERT for pre-training and fine-tuning for detecting neurodegenerative diseases tasks.
Tech Stack: Pytorch

HugTokenCraft: Python library to modify BERT Tokenizer
Developed a Python library to simplify the process of vocabulary modification (add/remove tokens from dictionary) of a pre-trained tokenizer (BertTokenizer) used in BERT
Tech Stack: BERT

Efficient Machine Learning

Parkinson's Disease Classification via EEG: All You Need is a Single Convolutional Layer
Developed LightCNN, a single-layer CNN for Parkinson's disease classification using EEG data. LightCNN outperformed complex models in accuracy, precision, and recall while remaining lightweight and interpretable, making it ideal for deployment in resource-constrained settings, capturing clinically relevant neurophysiological patterns associated with PD.
Tech Stack: Pytorch, EEG data

Real-time Classification of Time Series
Developed supervised models using Linear Discriminant Analysis for binary classification of real-time incoming time series data of brain activity using frequency-domain features for adaptive brain stimulation.
Tech Stack: Python

Efficient Feature Engineering for Time Series Classifier
Developed manual features using signal processing (spectral power) from LFP brain activity time series (RCS brain-implant) for computationally efficient sleep stage classification and achieved over 90% accuracy with lightweight ML models (SVM), matching DL models (CNN, Transformer, RNN) with automatic feature engineering (published in Nature Communications)
Tech Stack: Matlab, R

Efficient Signal Processing

TurboLPC: Python library for Signal Processing
Developed an efficient Python library for stochas￾tic modeling of time series data for speech and audio analysis, data compression, and feature extraction. TurboLPC is 1,000x faster and comes with an advanced variation of frequency-warping for non-uniform frequency resolution
Tech Stack: Python

Time series Classification

Generalized Multi-Class Supervised Models for Time Series
Created multi-class supervised models for time series data of brain activity, achieving an average accuracy of 85%. Implemented extensive feature extraction, including 74 features (18 frequency-domain, 10 time-domain and additional coherence features). Evaluated 8 machine learning models, including ridge regression, SVM, KNN, decision tree, random forest, XGBoost, LightGBM, and artificial neural network.
Tech Stack: Python

Novel Feature Extraction Method for Time Series Classification
Designed a computationally efficient method to obtain features (via stochastic modeling) and ML algorithm for supervised classification of time series data improving both performance (+13% accuracy) and computational cost (5× faster).
Tech Stack: Matlab

Regression Model for Time Series Data
Designed ML algorithm that utilizes stochastic characteristics of time series data as features to estimate observed measurements.
Tech Stack: Matlab

Source Estimation from Time Series
Utilized single/multi-channel sensor time series data to estimate the characteristics of the source for localization and classification tasks.
Tech Stack: Matlab

Miscellaneous

Statistical Evaluation Framework
Developed complex statistical models and tests (Linear Mixed effects/t-test/MANOVA/Hierarchical Cluster Analysis) for time series data and features (via cross-correlation, spectral coherence, spectrogram, and wavelet analyses) to evaluate the stability of data-driven clustering and discover neurophysiological changes in brain networks.
Tech Stack: R

Automated Artifact Detection & Removal
Developed algorithms to detect and rectify abnormal data and noise artifacts in time series data of brain activities using signal processing tools (template matching, cross-correlation, Spectral power threshold, Kalman Filtering) and unsupervised ML clustering (Multivariate Auto-regressive HMM).
Tech Stack: Matlab

Data Synchronization among Wearable Devices
Designed signal processing algorithms for synchronizing time series data from multiple wearable and implanted devices while accounting for missing data, disconnections, mismatch of timestamps, and non-uniform sampling rates.
Tech Stack: Matlab

Data Processing Pipeline for Wearables
Developed a data collection and processing pipeline for wearable devices for building ML models and conducting statistical analyses for automatic data collection from servers (via RESTful APIs), data quality checks, resampling, artifact rejection, manual data corrections, and pre-processing (filtering).
Tech Stack: Matlab

Data Processing Tool for EEG
Developed pipeline for processing raw time series data (EEG) for building ML models and feature engineering.
Tech Stack: Matlab

Motion Segmentation using Subspace Clustering
Utilized ML algorithms (Mixture of Probabilistic Principal Component Analysis, k-Subspaces and Sparse Subspace clustering) to recognize moving objects in video.
Tech Stack: Matlab

Cloud-based AI-Assisted Education Platform
Developed a smartphone app (Android) and server-side web application for an AI-assisted education platform, supporting 10,000+ users. The app, deployed via Ruby on Rails with a complex PostgreSQL database and hosted using open-source PaaS (CapRover).
Tech Stack: Ruby, Rails, Postgres, Java, Docker


Latest News


📢 [August 2024] We published LiPCoT that enables NLP of time series via LLM.
📢 [Febuary 2024] Our paper was accepted in Nature Communications.
📢 [October 2021] Received Computational Innovator Postdoctoral Fellowship of USD 75,000.
📢 [September 2021] Joined Little Lab at UCSF as a postdoctoral scholar.
📢 [August 2021] Successfully defended my doctoral disseration!
📢 [January 2021] Graduate College Post-Comprehensive Research Fellowship of USD 10,000.


Awards & Funding


Postdoctoral Fellowship Grant
Md Fahim Anjum
Computational Innovator Postdoctoral Fellowship, $75,000 (2021-22)
UC Noyce Initiative, 2021
link

Graduate Research Fellowship
Md Fahim Anjum
Graduate College Post-Comprehensive Research Fellowship, $10,000 (2021)
Graduate College, The University of Iowa
link

Featured Researcher
Md Fahim Anjum
Dare to Discover Campaign (2021)
Office of the Vice President for Research, The University of Iowa
link


Media Coverage


A Personalized Brain Pacemaker for Parkinson’s
News Article
August 20, 2024
The New York Times
link

Taming Parkinson’s Disease with Intelligent Brain Pacemakers
News Article
August 19, 2024
Science Daily
link

Taming Parkinson’s Disease with Intelligent Brain Pacemakers
News Article
August 19, 2024
UCSF News
link

ECE research could enhance Parkinson’s disease treatment in rural settings
News Article
January 4, 2024
College of Engineering, The University of Iowa
link

University of Iowa researchers develop algorithm to detect Parkinson’s Disease
News Article
September 9, 2020
The Daily Iowan
link

UI engineers and neurologists develop a highly efficient algorithm that can detect Parkinson’s Disease through EEG data
News Article
August 31, 2020
College of Engineering, The University of Iowa
link


Patents


Apparatus, systems and methods for diagnosing parkinsons disease from electroencephalography data
Soura Dasgupta, Kumar Narayanan, Md Fahim Anjum, Raghuraman Mudumbai
United States Patent Application 17/020,432
link


Publications

For latest publications, please visit my Google Scholar profile.

Parkinson's Disease Classification via EEG: All You Need is a Single Convolutional Layer
Md Fahim Anjum
arXiv preprint arXiv:2408.10457, 2024
pdf | code

LiPCoT: Linear Predictive Coding based Tokenizer for Self-supervised Learning of Time Series Data via Language Models
Md Fahim Anjum
arXiv e-prints, arXiv:2408.07292, 2024
pdf | code

Generalized sleep decoding with basal ganglia signals in multiple movement disorders
Zixiao Yin, Huiling Yu, Tianshuo Yuan, Clay Smyth, Md Fahim Anjum, Guanyu Zhu, Ruoyu Ma, Yichen Xu, Qi An, Yifei Gan, Timon Merk, Guofan Qin, Hutao Xie, Ning Zhang, Chunxue Wang, Yin Jiang, Fangang Meng, Anchao Yang, Wolf-Julian Neumann, Philip Starr, Simon Little, Luming Li, Jianguo Zhang
npj Digital Medicine 7 (1), 122, 2024
pdf | code

Multi-night cortico-basal recordings reveal mechanisms of NREM slow-wave suppression and spontaneous awakenings in Parkinson’s disease
Md Fahim Anjum, Clay Smyth, Rafael Zuzuárregui, Derk Jan Dijk, Philip A. Starr, Timothy Denison, Simon Little
Nature Communications 15 (1793), 6, 2024
pdf

Resting-state EEG measures cognitive impairment in Parkinson’s disease
Md Fahim Anjum, Arturo I Espinoza, Rachel C Cole, Arun Singh, Patrick May, Ergun Y Uc, Soura Dasgupta, Nandakumar S Narayanan
npj Parkinson's Disease 10 (1), 6, 2024
pdf | code

Adaptive Deep Brain Stimulation for sleep stage targeting in Parkinson’s disease
Clay Smyth, Md Fahim Anjum, Shravanan Ravi, Timothy Denison, Philip Starr, Simon Little
Brain stimulation 16 (5), 1292-1296, 2023
pdf

Resting-state EEG Predicts Cognitive Impairment in Parkinson’s Disease (P6-11.015)
Ergun Uc, Fahim Anjum, Soura Dasgupta, Nandakumar Narayanan
Neurology 100 (17 Supplement 2), 2, 2023
pdf

Chronic motor cortical electrocorticography reveals negative correlation between slow wave and beta activity during deep NREM sleep in Parkinson’s disease
Md Fahim Anjum, Clay Smyth, Shravanan Ravi, Tanner Dixon, Philip Starr, Timothy Denison, Simon Little
Neuroscience 2022: Society for Neuroscience, Session 214, 2022
pdf

A pilot study of machine learning of resting-state EEG and depression in Parkinson’s disease
Arturo I. Espinoza, Patrick May, Md Fahim Anjum, Arun Singh, Rachel C. Cole, NicholasTrapp, Soura Dasgupta, Nandakumar S. Narayanan
Clinical Parkinsonism & Related Disorders 7, 2, 2022
pdf

Electroencephalography and Local Field Potentials in Parkinson's Disease: Models and Diagnosis
Md Fahim Anjum
The University of Iowa, 2021
pdf

Linear predictive coding distinguishes spectral EEG features of Parkinson's disease
Anjum Md Fahim, Dasgupta Soura, Mudumbai Raghuraman, Singh Arun, Cavanagh James F., Narayanan Nandakumar S.
Parkinsonism & Related Disorders, 107, 2020
pdf | code

Linear Predictive Approaches Separate Field Potentials in Animal Model of Parkinson's Disease
Anjum Md Fahim, Haug Joshua, L. Alberico Stephanie, Dasgupta Soura, Mudumbai Raghuraman, A. Kennedy Morgan, Narayanan Nandakumar S.
Front. Neurosci., 6, 2020
pdf

Unique Maximum Likelihood Localization of Nuclear Sources
BDO Anderson, S Dasgupta, HE Baidoo-Williams, MF Anjum, R Mudumbai
58th IEEE Conference on Decision and Control, 1, 2019
pdf

On the Uniqueness of Maximum Likelihood Estimation of Nuclear Sources
J Haug, MF Anjum, BDO Anderson, S Dasgupta, R Mudumbai, HE Baidoo-Williams
Proceedings of MTNS, 1, 2018
pdf

Optimal Non-coherent Data Detection for Massive SIMO Wireless Systems with General Constellations: A Polynomial Complexity Solution
Haider Ali Jasim Alshamary, Md Fahim Anjum, Tareq Al-Naffouri, Alam Zaib, Weiyu Xu
arXiv e-prints, arXiv: 1507.02319, 2015
pdf

Maximum-Likelihood Joint Channel Estimation and data Detection for 1-bit Massive MIMO
Md Fahim Anjum, Weiyu Xu, 2015
pdf