sunshine-JLU

Follow

sunshine-JLU

Follow

Looking for LLM/VLM opportunities

6 followers · 14 following

Hong Kong University
Hongkong

Achievements

Achievements

sunshine-JLU/README.md

Hi, I'm Runheng Cai 👋

AI/ML learner focusing on Large Language Models (LLMs).

🚀 Current Focus

LLM Alignment: RL & MLLM

🎓 Education

Master@HKU Computer Science, The University of Hong Kong (HKU)

Bachelor@JLU Information Engineering, Jilin University (JLU)

💌 Contact Me

Email: u3619679@connect.hku.hk
Wechat: crh_8318

Pinned Loading

cis_grpo cis_grpo Public

CIS-GRPO: Contrastive Image Sampling for GRPO training of VLMs

Python 1
deepseek-janus-pro-lora deepseek-janus-pro-lora Public

The objective of this project is to demonstrate how to fine-tune deepseek-janus-pro-lora.

Python 41 5
AnyLLM-to-VLM AnyLLM-to-VLM Public

Turn any text-only LLM into a Vision-Language Model through efficient training.

Python 24 2
deepseek-r1-distill-llama-8b-lora deepseek-r1-distill-llama-8b-lora Public

The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.

Jupyter Notebook 17 5
Who_is_the_undercover_agent_LLM Who_is_the_undercover_agent_LLM Public

The function of this project is to have different large models play the game of who is the undercover agent

HTML 1
deepseek-r1-distill-qwen-7B-lora deepseek-r1-distill-qwen-7B-lora Public

The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-qwen-7B.

Jupyter Notebook 3