- Architected and engineered scalable backend schemas and infrastructure, optimizing data storage and retrieval for high efficiency in real-time client applications.
- Leveraged AWS Bedrock for Retrieval-Augmented Generation implementations, enhancing document processing and query answering on complex insurance data. Utilized AWS SageMaker to design, train, and deploy machine learning models, achieving a predicted avoidance of 30% of adverse events for clients.
- Built a robust, responsive frontend platform using Next.js, providing user-friendly interfaces for clients and insurers.
- Integrated AWS Cognito for secure, seamless authentication and authorization, enabling distinct dashboards tailored to clients and insurers.
- Conceptualized and standardized the company's design theme and colour scheme, ensuring brand consistency and an intuitive user experience. Boosted usability by implementing cohesive design layouts across the platform, improving navigation and engagement.
- Deployed large language models by developing intelligent agents to enhance backend flexibility and responsiveness, enabling dynamic data processing and improving system performance and scalability.
work experience
Data Scientist Intern
@ Parametriks
August 2024 - Present
Paris, France
Data Science
Data Analytics
Machine Learning
Research Intern
@ Singapore Immunology Network (SIgN)
May 2024 - August 2024
Singapore
Project: To advance analytics capabilities for cancer vaccine research using Nanopore sequencing technology by developing a real-time analysis pipeline for TCR repertoire analysis.
- Background Information: Long-read sequencing technologies, such as Nanopore, are prone to errors and require sophisticated data cleaning methods for accurate sequence recovery. Addressing this challenge is crucial for advancements in bioinformatics and cancer research.
- Final Pipeline
- Analysed large-scale genomic datasets exceeding 7 million rows, producing critical insights that influenced shifts in research focus.
- Integrated over 11 open-source bioinformatics tools into a custom automated pipeline, streamlining analysis for enhanced efficiency.
- Developed Shell and Python automation scripts to extract and group cell barcodes, segment TCR α and β chains per cell, and reconstruct these chains using de novo assembly. This improved barcode recovery from 608,700 to 2,181,878, significantly advancing cell barcode extraction processes.
- Implemented a Python-based method for categorizing sequences into TCR α and β chains without needing a whitelist. This method used Ward's Linkage clustering, achieving an average accuracy of 90% on sequences from 100 unique cell barcodes, demonstrating effectiveness in clonotype identification for long-read sequencing.
- Proposed a novel approach for separating TCR chains that outperformed traditional methods, aiding in clonotype identification crucial for understanding immune responses in cancer vaccine research.
- Customized Shasta for the assembly of TCR α and β sequences, addressing the lack of pre-existing configurations for TCR-specific assembly and pioneering new methodologies.
- Additional Achievements
- Innovated and validated a method that separated TCR α and β chains without reference mapping, ensuring robustness in diverse data contexts.
- Conducted multiple iterations of the pipeline, experimenting with over 20 bioinformatics tools and methodologies. This intensive work led to major research shifts based on in-depth data analysis.
- Co-authored the paper titled Refining TCR Clonotype Identification With Long-Read Sequencing Technique, submitted to the Society for Immunotherapy of Cancer (SITC), contributing to advancements in cancer vaccine development.
Bioinformatics
Data Science
Machine Learning
Research
President
@ NUS SoC Computing Club
September 2023 - September 2024
Singapore
- In my role as President, I have the privilege of leading a dedicated team of Vice Presidents and secretaries. Together, we are committed to charting a course towards new heights of excellence for the Computing Club.
- My tenure as President is driven by a three-fold mission:
- Strengthening Internal Bonds
- Establishing Partnerships
- Building Relationships with Advisors
- Our Achievements:
- Led a team of 30 in organising over 20 club events relating to student life and development, impacting 5,000 undergraduates
- Allocated and managed club's finance of up to $300,000 to 4 departments organising over 20 events
- Leading exchange program policy change, potentially impacting 3,000 undergraduates
Management
Leadership
Project Management
Software Developer Intern
@ LFG
May 2023 - August 2023
Vietnam, Ho Chi Minh City
- Collaborated with 4 development team members to design and implement website and product features
- Wrote over 5,000 lines of clean, efficient, and maintainable code to develop MVP
- Collaborated with 3 designers to ensure user interface is responsive and user-friendly, aligning design and development team
- Conducted business development and product pitching at networking events to VCs.
TypeScript
Prisma Studio
TRPC
React.js
Web Developer Lead
@ ASEAN Business Youth Association
June 2023 - September 2024
Singapore
- Directed a team of 3 website developers, 2 UI/UX designers and 2 copywriters from 3 different ASEAN countries in building a new full-stack website application
- Conduct code revisions and optimisation on a bi-weekly basis
- Wrote over 39,998 lines of code to implement user authentication and key features
TypeScript
React.js
Firebase
Project Management
Scrum