Resumé
💼 Experience #
TopCV Vietnam Joint Stock Company #
Data Science Manager (10/2020 - 10/2025)
- Built and scaled Data Science team from scratch to 18 members (currently 13), establishing hiring processes, team structure, operating cadence and technical roadmap
- Led development of AI-powered features for Vietnam’s largest career platform, serving millions of users: CV Parser, Job Recommendation, Candidate Recommendation, JD-writing assistant, Automated job-post moderation, …
- Established a company-wide data culture - strategic business and product decisions are consistently data-driven.
- Designed and operated a data platform on 2 different environments (cloud 2020-2024, on-premise 2025), with continuous improvements in reliability, performance, and cost efficiency
- Mentored and upskilled the team, fostered a high-performance working culture, and led company-wide Data & AI Literacy programs
AI Engineer (01/2020 - 09/2020)
- Launched the CV Parser initiative, defining annotation standards, building the data-labeling pipeline, and developing extraction models
- Partnered with domain stakeholders to map business workflows and identify high-impact AI use cases across product and operations
YBox Menteelogy #
Mentor (03/2023 - 08/2024)
- Shared practical Data Science knowledge and industry experience via 1:1 sessions for 17 mentees
- Reviewed resumes, helped mentees build recruiter-ready portfolios, and prepare for interviews
Finsify Limited Company #
Data Scientist/Lead of AI Team (09/2017 - 08/2018)
- Led AI team in developing financial technology solutions using machine learning
- Built ML features for a personal financial management app: bank transaction classification, captcha recognition
- Designed and implemented data pipelines for real-time processing
VNG Corporation #
Research Scientist (08/2016 - 04/2017)
- Prototyped and proposed a real-time fraud detection system for ZaloPay
- Built the NLU module for the 123Xe dialog system—Vietnamese intent classification and slot filling
RichAnchor Technology #
Machine Learning Engineer (03/2016 - 08/2016)
- Developed ML models for content recommendation system of an online book store: Document classification, Entity Extraction
- Implemented end-to-end ML pipelines from data collection to model deployment
🏫 Education #
City University of Hong Kong #
Ph.D. Student in Computer Science (09/2018 - 12/2019)
Research topic: Multi-modal dialogue system
Supervisor: Prof. Chong-Wah Ngo
Status: Left program to pursue industry
FPT University #
Bachelor’s Degree in Computer Science (09/2011 - 12/2015)
Capstone Project: Building a Semantic Role Labelling Toolkit for Vietnamese
GPA: 8.63/10
📝 Publication #
Building a Spoonerism Detection System for Vietnamese #
Thai-Hoang Pham, Xuan-Khoai Pham
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, Hong Kong, ACL Anthology, 2018
NNVLP: A Neural Network-Based Vietnamese Language Processing Toolkit #
Thai-Hoang Pham, Xuan-Khoai Pham, Tuan-Anh Nguyen, Phuong Le-Hong
Proceedings of the 8th International Joint Conference on Natural Language Processing, Taipei, Taiwan, ACL Anthology, 2017
On the Use of Machine Translation-Based Approaches for Vietnamese Diacritic Restoration #
Thai-Hoang Pham, Xuan-Khoai Pham, Phuong Le-Hong
Proceedings of the 21st International Conference on Asian Language Processing, Singapore, IEEE, 2017
Building a semantic role labelling system for Vietnamese #
Thai-Hoang Pham, Xuan-Khoai Pham, Phuong Le-Hong
Proceedings of the 10th International Conference on Digital Information Management, Jeju Islands, South Korea, IEEE, 2015
🛠️ Technical Skills #
Programming & Languages #
- Languages: Python, SQL
- ML/DL Frameworks: PyTorch, Scikit-learn, XGBoost, flair-nlp, LangChain/LangGraph
- Data Engineering: Apache Spark, Airflow, ETL Pipelines, Data Warehousing
- Cloud & Infrastructure: GCP, Docker, Kubernetes, MLflow
Domain Expertise #
- Machine Learning: Supervised/Unsupervised Learning, Deep Learning, Ensemble Methods
- Natural Language Processing: Vietnamese NLP, Text Classification, Named Entity Recognition, Semantic Analysis
- Data Science: Statistical Analysis, A/B Testing, Experimentation, Data Visualization
- Leadership: Team Building, Technical Mentoring, Cross-functional Collaboration, Strategic Planning
Tools & Platforms #
- Development: Git, Jupyter, VS Code, CI/CD
- Data Tools: Pandas, NumPy, Matplotlib, Seaborn, Plotly
- ML Ops: Model Deployment, Monitoring, Versioning, Production Systems
🏅 Achievement #
Third Prize (Rank 10), ACM-ICPC Asian Regional Contest (site Vietnam) - 2015
Honorable mention (Rank 21), ACM-ICPC Asian Regional Contest (site Thailand) - 2015
Second Prize, Informatics Olympic Contest for Students, Division A - 2014
Third Prize, National Contest for gifted students, Subject: Physics - 2011