Skip to content
View Sathvik2954's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Sathvik2954

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sathvik2954/README.md

AI/ML Engineer · Full Stack Developer · Problem Solver

B.Tech Artificial Intelligence & Machine Learning · CBIT Hyderabad · CGPA 9.12

Gmail LinkedIn Portfolio LeetCode


About Me

I am a sociable and impact-driven developer with experience delivering ML and full-stack solutions through internships, research, and collaborative projects. I enjoy translating real-world problems into practical technology — and thrive in collaborative environments where taking initiative matters.

Currently pursuing B.Tech in AI/ML at CBIT Hyderabad (2023–2027).


What I Build

Domain Focus
Software Engineering Offline-first systems, real-time sync, conflict resolution, WebRTC, role-based auth
Full Stack Development React, Next.js, Node.js, FastAPI, PostgreSQL, MongoDB
AI Engineering Agentic AI, RAG pipelines, tool-calling agents, LLM integration, vector databases
Machine Learning Computer vision, NLP, multimodal learning, transformer fine-tuning
Data Science & Analytics Predictive modeling, SQL analytics, business dashboards, ROI optimization

SDE & Full Stack Projects

CONTINUUM

React TypeScript Node.js Express MongoDB Dexie.js WebRTC Socket.io PWA i18next

Offline-first healthcare continuity platform connecting patients and doctors through a single portable health record that works with or without internet. Patients record symptoms by voice, track vitals, upload documents, and consult doctors asynchronously — all data queued locally in IndexedDB and synced automatically on reconnect. Includes scheduled WebRTC video calls with call audio saved to the patient timeline, conflict resolution via HTTP 409 on stale writes, and multilingual support in English, Hindi, and Telugu.

Roles: Patient · Doctor · Admin

Live


SETHU

Next.js 14 TypeScript Supabase PostgreSQL RLS FastAPI Mistral API RapidOCR pdf-lib pg_cron

Role-based campus management platform for CBIT Hyderabad serving students, faculty, HODs, and administrators. Access control is enforced at the database layer through Row Level Security — every query auto-filters to what that role is permitted to see. Timetables can be imported from PDFs via OCR; the AI study planner (Mistral) reads subject annotations and the day's schedule to return a ranked priority list. Requests auto-generate approval PDFs, and pg_cron handles exam reminders and deadline alerts.

Roles: Student · Faculty · HOD · Admin

Live


Syncpad

React 19 Liveblocks Yjs Monaco Editor LiveKit Node.js MongoDB Clerk Judge0

Real-time collaborative coding interview platform built in 24 hours at CBIT Hacktoberfest 2025 — Special Mention among ~500 teams. Features multi-user CRDT-based code editing via Liveblocks and Yjs, audio/video conferencing via LiveKit WebRTC, code execution through Judge0, session replay for post-interview review, and a gamified daily quiz system. Resume analysis powered by Mistral AI extracts skills and generates candidate summaries.

Contributions: Quiz module, gamified daily quiz, Resume Analyzer, frontend UI and routing


AI / ML Projects

PathVQA — Multimodal Visual Question Answering

PyTorch EfficientNet-B0 ResNet50 Faster R-CNN BiLSTM GRU BAN Stacked Attention

Research project designing and benchmarking three multimodal VQA architectures on the PathVQA dataset (32,799 QA pairs, 4,998 pathology images). Implemented region-based visual reasoning with Faster R-CNN + GRU + BAN, global CNN encoding with EfficientNet-B0 + BiLSTM + Bilinear Fusion, and iterative attention with ResNet50 + Stacked Attention. Best result: 60.39% overall exact match and 53.45% open-ended EM with EfficientNet-B0 + Bilinear Fusion. Key finding: fusion strategy has more impact than backbone complexity on pathology images.


Zenvia — Fashion Intelligence Platform

Flask PyTorch ResNet-50 MediaPipe Pose OpenCV SerpAPI Mistral API Cloudinary

Full-stack AI platform with five integrated modules: real-time body size estimation via MediaPipe Pose, seasonal color classification using a 5-fold ResNet-50 ensemble (94.18% validation accuracy), live product discovery via SerpAPI Google Shopping, a virtual wardrobe with outfit scheduling and weather-based suggestions, and a conversational FashionBot powered by Mistral. Presented at ICAIATI-2025, paper under publication.


Hindi News Classification System

IndicBERTv2 Flask PyTorch HuggingFace Transformers EasyOCR BeautifulSoup Docker

End-to-end Hindi NLP platform classifying news headlines into five categories through three input modes: typed text, live scraping from Amar Ujala, Dainik Jagran, Navbharat Times, and BBC Hindi, and OCR extraction from images and PDFs. Fine-tuned IndicBERTv2 outperforms mBERT (73.98%) and XLM-RoBERTa (78.06%) with 79.57% accuracy due to its Indic-specific pretraining. Deployed on HuggingFace Spaces via Docker.

Live


Data Science & Analytics Projects

Telecom Churn Analysis

Python Scikit-learn RandomForest Pandas Matplotlib Seaborn

Business-optimized churn prediction on 7,043 telecom customers. Instead of maximizing accuracy, the decision threshold was tuned from 0.5 to 0.45 to prioritize recall — because a missed churning customer (₹2,000 LTV loss) costs four times more than a wasted retention offer (₹500). RandomForest achieved 82.4% recall and 0.822 ROC-AUC, identifying 861 at-risk customers and projecting ₹161,500 net business value. Top churn drivers: tenure under 6 months (53.3% churn rate) and month-to-month contracts (42.7%).


Olist E-Commerce Analytics

MySQL 8.0 Python pandas SQLAlchemy Power BI DAX

End-to-end analytics pipeline on 100,000+ Brazilian e-commerce orders from the Olist marketplace. Answered six business questions through 10 SQL queries covering revenue by category, delivery performance by state, late delivery impact on review scores, repeat purchase rate, payment method breakdown, and month-over-month revenue growth. Findings delivered through a three-page executive Power BI dashboard. Key finding: deliveries delayed 4+ days average a 1.86/5.0 review score versus 4.29/5.0 for early deliveries.

Dataset


Customer Reviews Topic Modeling

Gensim BERTopic Scikit-learn sentence-transformers UMAP HDBSCAN spaCy NLTK

Unsupervised topic discovery across 630,000+ app reviews from 11 e-commerce platforms including Amazon, Flipkart, Myntra, and Meesho. Five methods benchmarked — LDA, NMF, LSA, BERTopic, LDA+Bigrams — with results aggregated into a consensus topic set via cosine similarity of topic-word vectors. NMF ranked first on both coherence (0.5739) and diversity (0.8333) across all 11 individual apps. All 10 consensus topics confirmed HIGH CONFIDENCE, covering delivery, refunds, customer service, app bugs, and pricing.

Dataset


Tech Stack

Languages

Python JavaScript TypeScript SQL

Frontend

React Next.js Tailwind CSS Vite

Backend

Node.js Express FastAPI Flask

Databases & Infra

MongoDB PostgreSQL MySQL Supabase AWS

AI / ML

PyTorch TensorFlow Scikit-learn HuggingFace OpenCV

Tools

Git Linux Postman Power BI


Certifications & Achievements

AWS Certified Cloud Practitioner Amazon Web Services
Salesforce Certified Agentforce Specialist Salesforce · 2025
Certificate of Proficiency in AI/ML IIIT Hyderabad — iHub-Data · May–Sep 2025
Special Mention · Hacktoberfest 2025 ~500 teams · product completeness & innovation
Research Publication ICAIATI-2025 · Zenvia · Under publication
Vice President · Robotics & Innovation Club CBIT · 2024–2026

Connect With Me

"Bide your Time. Hide your Strength."

Gmail LinkedIn Portfolio

"First, solve the problem. Then, write the code." — John Johnson

⭐ If you find my work useful, drop a star! ⭐

Pinned Loading

  1. Portfolio Portfolio Public

    Personal portfolio built with vanilla HTML, CSS, and JavaScript — no frameworks.

    HTML 1

  2. Sethu Sethu Public

    SETHU - Role based campus management platform for CBIT with department-scoped timetables, AI-assisted study planning, request approvals, and automated academic notifications. Built with Next.js, Su…

    TypeScript 1

  3. customer-reviews-topic-modeling customer-reviews-topic-modeling Public

    Multi-method topic modeling pipeline on 630K customer reviews from 11 e-commerce platforms.

    Jupyter Notebook 1