PhD in Computer Science · Data Scientist & AI Engineer · NLP Specialist
Aix-en-Provence, France · Open to new opportunities
Data Scientist and AI Engineer with a PhD in Computer Science (Aix-Marseille Université, 2022), specialising in NLP, LLMs, RAG and generative AI. I combine published ML research with hands-on product engineering.
- 🎓 PhD in Computer Science — Aix-Marseille Université, LIS / IrAsia — ERC Advanced Grant ENP-China (n° 788476)
- 📄 7 peer-reviewed publications — LREC-COLING, NLP4DH, TALN, PACLIC, JDMDH, JHNR
- 🌍 Presented research in 7 countries (France, USA, China, Italy, Japan, India, Vietnam)
- 🔭 Currently building a Data & AI SaaS platform — RAG pipeline, text-to-SQL, multi-tenant architecture
Languages
ML / AI
LLM & Generative AI
litellm · LangGraph · Langfuse · RAG / GraphRAG / RAPTOR · Hybrid Search (BM25 + pgvector) · Reranking · Text-to-SQL · Prompt Engineering · VLM
Full-Stack & Infrastructure
| Project | Description | |
|---|---|---|
| HistText | Full-stack platform for large-scale analysis of historical Chinese texts (billions of tokens). Rust backend, React UI, Apache Solr, multilingual NER pipeline, R package on CRAN. Deployed for the international digital humanities community. | 🌐 Live |
| EventExtractionPapers | Curated and actively maintained list of NLP papers, datasets and models for event extraction. Widely used reference in the research community. | ⭐ 580 |




