samsad35

Follow

🎯

Focusing

Samir Sadok samsad35

🎯

Focusing

Follow

Postdoctoral researcher

7 followers · 5 following

Achievements

Achievements

samsad35/README.md

Hi, I'm @samsad35 👋

Postdoctoral Researcher | AI, Speech & Audio Processing

🌐 Visit my portfolio

👨‍🔬 About Me

👋 Hi, I’m @samsad35!
🎓 Postdoctoral Researcher in AI, with research spanning several key axes: Interpretability, Generative Models, Self-Supervised Learning (SSL), and Multimodal Data (with a special focus on audiovisual speech data).
🔬 My current work explores:
- Self-Supervised Learning: Advancing representation learning for speech and audio, including the development and evaluation of neural audio codecs.
- Interpretability: Analyzing latent spaces, preventing representation collapse, and building tools to evaluate complex representation metrics.
- Generative Models: Exploring modern generative paradigms for high-quality audio and speech modeling.
- Multimodal Architectures: Integrating multiple data streams, such as audiovisual and articulatory data, to build more robust predictive models.
🚀 Experienced in running large-scale distributed training on high-performance computing clusters (SLURM, multi-GPU environments).

🛠️ Tech Stack & Tools

Languages: Python, HTML/CSS
Machine Learning / AI: PyTorch, Self-Supervised Learning, Generative Models
Audio & Signal Processing: Neural Codecs, audiomentations, SSL Metric Evaluation
Infrastructure: SLURM (High-Performance Computing), Linux Environments

📊 GitHub Stats

📫 Let's Connect

Personal Website / Portfolio: samsad35.github.io

Pinned Loading

source-filter-vae source-filter-vae Public

[SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder

Python 46 5
VQ-MAE-S-code VQ-MAE-S-code Public

[ICASSPW] A Vector Quantized Masked AutoEncoder for speech emotion recognition

Python 30 1
code-mdvae code-mdvae Public

[Neural Networks] A multimodal dynamical variational autoencoder for audiovisual speech representation learning

Python 5 2
code-ancogen code-ancogen Public

[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

Python 14 1
VQ-MAE-AudioVisual-code VQ-MAE-AudioVisual-code Public

[CVIU] A vector quantized masked autoencoder for audiovisual speech emotion recognition

Python 4 1