Skip to content
View samsad35's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report samsad35

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
samsad35/README.md

Hi, I'm @samsad35 πŸ‘‹

Postdoctoral Researcher | AI, Speech & Audio Processing

🌐 Visit my portfolio

πŸ‘¨β€πŸ”¬ About Me

  • πŸ‘‹ Hi, I’m @samsad35!
  • πŸŽ“ Postdoctoral Researcher in AI, with research spanning several key axes: Interpretability, Generative Models, Self-Supervised Learning (SSL), and Multimodal Data (with a special focus on audiovisual speech data).
  • πŸ”¬ My current work explores:
    • Self-Supervised Learning: Advancing representation learning for speech and audio, including the development and evaluation of neural audio codecs.
    • Interpretability: Analyzing latent spaces, preventing representation collapse, and building tools to evaluate complex representation metrics.
    • Generative Models: Exploring modern generative paradigms for high-quality audio and speech modeling.
    • Multimodal Architectures: Integrating multiple data streams, such as audiovisual and articulatory data, to build more robust predictive models.
  • πŸš€ Experienced in running large-scale distributed training on high-performance computing clusters (SLURM, multi-GPU environments).

πŸ› οΈ Tech Stack & Tools

  • Languages: Python, HTML/CSS
  • Machine Learning / AI: PyTorch, Self-Supervised Learning, Generative Models
  • Audio & Signal Processing: Neural Codecs, audiomentations, SSL Metric Evaluation
  • Infrastructure: SLURM (High-Performance Computing), Linux Environments

πŸ“Š GitHub Stats

samsad35's GitHub stats

Top Languages

πŸ“« Let's Connect

Pinned Loading

  1. source-filter-vae source-filter-vae Public

    [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder

    Python 46 5

  2. VQ-MAE-S-code VQ-MAE-S-code Public

    [ICASSPW] A Vector Quantized Masked AutoEncoder for speech emotion recognition

    Python 30 1

  3. code-mdvae code-mdvae Public

    [Neural Networks] A multimodal dynamical variational autoencoder for audiovisual speech representation learning

    Python 5 2

  4. code-ancogen code-ancogen Public

    [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

    Python 14 1

  5. VQ-MAE-AudioVisual-code VQ-MAE-AudioVisual-code Public

    [CVIU] A vector quantized masked autoencoder for audiovisual speech emotion recognition

    Python 4 1