Skip to content
View yesabhishek's full-sized avatar
🏊
swimming in data lake :)
🏊
swimming in data lake :)

Block or report yesabhishek

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yesabhishek/README.md

Abhishek Choudhury

Senior Software Engineer specializing in distributed systems, AI-driven platforms, and data engineering. Building scalable solutions that solve real-world problems at enterprise scale.

About

I architect and build high-performance distributed systems with a focus on AI/ML integration, data pipelines, and cloud-native architectures. With over 5 years of experience shipping production-grade software for Fortune 100 companies and startups alike, I've led engineering teams and delivered platforms processing millions of transactions daily.

Currently working on healthcare SaaS platforms and agentic AI systems at ValueLabs, where I lead backend engineering initiatives for clients including Pfizer, Eli Lilly, Novartis and others.

Technical Expertise

Languages & Frameworks
Python, SQL | Django, FastAPI, Flask, PyTorch, PySpark, LangChain, LangGraph

Data & Cloud Infrastructure
PostgreSQL, MongoDB, Cassandra, Redis, Snowflake, BigQuery, DuckDB | AWS, GCP, Docker, Kubernetes, Kafka, Airflow, DBT

AI/ML & Specializations
Generative AI, RAG Systems, Machine Learning, Vector Databases (PgVector, FAISS, Weaviate), OpenAI, Ollama

Notable Projects

Agentic AI Orchestration Platform

Engineered an autonomous AI system using LangChain and LangGraph that automates market trend analysis, sentiment analysis, and GTM strategy generation for a Fortune 100 client. Improved forecast accuracy by 40% through intelligent agent coordination and retrieval-augmented generation.

Pyaw - Customer Service RAG Platform

Built an enterprise RAG-based customer service platform achieving 95% query resolution accuracy. Reduced support ticket volume by 40% by enabling natural language interactions with knowledge bases. Deployed as multi-tenant SaaS (iQ Suite) with integrated billing and user management.

Text2SQL Intelligence

Developed and fine-tuned a production-grade Text2SQL model using PyTorch and Hugging Face that converts natural language to SQL queries with fine-grained access controls. Outperformed existing market solutions by 30% in accuracy and query complexity handling.

Healthcare Data Pipeline Architecture

Designed and implemented data pipelines using Python, Snowflake, and Apache Airflow following Medallion architecture principles. Improved data delivery efficiency by 57% while maintaining data lineage tracking and governance standards for clinical trial operations.

High-Volume Fintech Platform

Led development of B2B healthcare/fintech platform handling 100K+ daily transactions for 1M+ users. Architected real-time data synchronization using Django, RabbitMQ, and Celery achieving 98% data accuracy. Implemented OCR/NLP pipeline for insurance policy summaries with 80% accuracy in under 5 seconds.

Professional Experience

Senior Software Development Engineer at ValueLabs
Leading backend engineering team and delivering AI-driven healthcare solutions for pharmaceutical giants.

Senior Software Development Engineer at Blue Hex Software
Built RAG platforms and AI/ML solutions for global clients including Intel, Nokia, Maersk, and Royal Enfield.

Software Development Engineer at Bima Garage
Architected fintech infrastructure handling millions of users and transactions with real-time processing capabilities.

Certifications

  • Snowflake Associate Certification (2025-2027)
  • AWS Cloud Practitioner (2023-2026)

Connect


I believe in building software that scales technically and creates measurable business impact. Currently exploring opportunities at the intersection of AI, data engineering, and distributed systems. Open to collaborating on interesting problems.

Pinned Loading

  1. pastebin-cli pastebin-cli Public

    GitHub-backed personal pastebin CLI with autosave, local cache, and sync.

    Go 1

  2. ada ada Public

    Semantic sidecar for Git repositories

    Go

  3. sqs-explorer sqs-explorer Public

    A modern, privacy-focused web interface for visualizing and exploring AWS SQS queues

    HTML 1

  4. endurance endurance Public

    Local-first private finance workspace for expenses, investments, Gmail sync, Kite MCP, and read-only AI analysis

    Go

  5. openrag openrag Public

    Forked from langflow-ai/openrag

    OpenRAG is a comprehensive, single package Retrieval-Augmented Generation platform built on Langflow, Docling, and Opensearch.

    Python