Summary
Overview
Work History
Education
Skills
Websites
Projects
Certification
Timeline
Generic
Sreejita Mazumder

Sreejita Mazumder

Nieuw-Vennep

Summary

Early career bioinformatics professional with hands-on experience in developing and validating machine learning models on health data. Strong foundation in study design, algorithm validation, and cross-functional collaboration. Proficient in Python, TensorFlow, and Scikit-learn, with a keen interest in applying AI and data analytics to healthcare challenges. Actively seeking a data scientist or data analyst role where I can contribute to innovative and safe clinical AI solutions through rigorous validation, stakeholder engagement, and a solid understanding of medical standards and regulations.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Bioinformatics Guest Employee

Amsterdam University Medical Center
09.2024 - Current
  • Applied and optimized XGBoost models to enhance clinical hypertension prediction, focusing on robust model validation and performance assessment by ~15%.
  • Collaborated with clinical researchers to ensure model outputs aligned with user needs and scientific rigor; currently contributing to manuscript drafting.

Bioinformatics Research Intern

Amsterdam University Medical Center
10.2023 - 08.2024
  • Designed and executed validation strategies for machine learning models predicting hypertension using structured longitudinal DNA methylation data (Illumina 850K EPIC array) from Ghanaian and Dutch cohorts.
  • Developed study protocols incorporating dimensionality reduction (PCA), statistical analysis (ICC), and RFECV-based feature selection to optimize classifiers (Random Forest, SVM, Logistic Regression), achieving a prediction accuracy of ~60%; ensured reproducibility via SLURM-based HPC workflows and GitLab version control.


Data Analyst

Tracxn Technologies
12.2019 - 05.2020

Analyzed large-scale datasets using Excel to identify industry trends; built dynamic Power BI dashboards and presented insights to cross-functional teams, supporting investment and strategic decisions.

Education

Master of Science - Bioinformatics

Vrije Universiteit Amsterdam & Universiteit Van Amsterdam
Amsterdam, Netherlands
08.2024

Bachelor of Technology - Biotechnology Engineering

Heritage Institute of Technology
Kolkata
08.2018

Skills

  • Programming & Analysis: Python, R, Bash, SQL(MySQL)
  • Version Control & Reproducibility: Git, GitLab
  • Cloud Platforms: Google Cloud Platform (Vertex AI)
  • Machine Learning: XGBoost, Scikit-learn, LightGBM
  • Deep Learning: TensorFlow, Keras, LSTM
  • Data Science & Validation: Exploratory Data Analysis (EDA), Feature Selection, Statistical Analysis, Model Validation, Time-Series Forecasting
  • Data Visualization: matplotlib, seaborn, ggplot2, Power BI
  • Research Methodologies: Study Design, Protocol Development, Cross-Validation Techniques, HPC Workflow Management (SLURM)
  • Languages: English (Native)







Projects

  • Time Series Forecasting with LSTM – Personal Project January 2025

 Designed and validated bidirectional LSTM models using TensorFlow to capture temporal dependencies in time-series data. Applied model evaludation techniques to improve prediction  accuracy, demonstrating robust validation and model assessment skills.


  • ML Engineer Recruitment Challenge – Draft Beer Image Classification February 2025

Developed and validated convolutional neural network (CNN) models using TensorFlow and the VGG16 architecture for image classification tasks. Demonstrated proficiency in handling unstructured image data, model evaluation, and delivering results within time-constrained environments.


  • Colorectal Cancer Classification – Vrije Universiteit Amsterdam February 2023 – March 2023

Developed and validated machine learning and deep learning models (Naive Bayes, SVM, MLP) on TCGA clinical data. Performed exploratory data analysis (EDA), feature engineering, and feature selection to improve model performance.


Breast Cancer Subtype Classification – Vrije Universiteit Amsterdam April 2022 – May 2022

Developed and validated machine learning classifiers (SVM, Logistic Regression, K-NN) to classify breast cancer subtypes using genomic array CGH data. Applied feature selection and dimensionality reduction techniques (UMAP) to improve model performance, demonstrating strong data analysis and model validation skills.




Certification

  • Data and AI Ethics Foundations (Udemy, April 2025)

Data Ethics, AI Ethics, Responsible AI, Data Privacy.


  • Statistics for Data Analysis using R (Udemy, April 2025)

Descriptive Statistics, Inferential Statistics, Probability Theory, Hypothesis Testing, Data Visualization, Regression Analysis, Data Analysis, Data Cleaning and Preprocessing.

Timeline

Bioinformatics Guest Employee

Amsterdam University Medical Center
09.2024 - Current

Bioinformatics Research Intern

Amsterdam University Medical Center
10.2023 - 08.2024

Data Analyst

Tracxn Technologies
12.2019 - 05.2020

Master of Science - Bioinformatics

Vrije Universiteit Amsterdam & Universiteit Van Amsterdam

Bachelor of Technology - Biotechnology Engineering

Heritage Institute of Technology
Sreejita Mazumder