Biostatistics - Data scientist & Bioinformatics enthusiast

Salut ! Je suis Fousseni SAMA

Explore my projects, read my blog, or get in touch.

👨‍💻

Overview

My journey in the field of data science began with a solid background in statistical engineering in Dakar, Senegal, where I acquired fundamental analytical skills at ENSAE. After gaining practical experience as a statistician and research assistant, I deepened my knowledge by applying statistics to biology and medicine through specialized training in biostatistics at Laval University. Since 2022, I have been putting these multidisciplinary skills into practice as a data analyst, working on projects at the intersection of biostatistics, bioinformatics, and machine learning. I use statistical modeling, genomic data analysis, and modern machine learning techniques to help solve complex problems in healthcare and life sciences research.

Skills

Skills

Python90%
R95%
Machine learning100%
Genomics60%

Tools & Technologies

PythonRTensorFlowPyTorchPandasBioconductorBLASTDockerSQLGit

Education

Master's in Biostatistics with thesis

Université Laval, Québec, Canada

January 2022 - January 2025

Advanced statistical methods, statistical modeling, high-dimensional data analysis, scientific programming (R, Python, Bash)

Bachelor's in Statistics: Statistical Engineering

ENSAE, Dakar, Senegal

September 2016 - August 2020

Probability, mathematical statistics, impact evaluation methods, propensity score and matching analysis, longitudinal and survival data analysis, machine learning predictive models for prognostic factor identification, big data, econometrics

Experiences

Data Manager and Data Analyst
ROSEPH | Québec, Québec, Canada
Design and implementation of data collection and analysis pipelines for projects. Development of detailed statistical analysis plans (SAP) including missing data management. Statistical analysis of data and manipulation of large multi-source databases with R and Excel to ensure data integrity and quality. Creation of interactive dynamic dashboards (R Shiny) for communicating complex results to non-technical audiences. Development of automated reporting tools with R and Power Platform, reducing report generation time by nearly 40%. Close collaboration with multidisciplinary teams to define analytical needs and translate scientific questions into actionable statistical analyses.
Data Analyst
University | Québec, Québec, Canada
Cleaning, validation and structuring of textual data from articles. Manipulation and harmonization of large multi-source databases with SQL, Python, Big Query and R to ensure data integrity and quality.
Data Analyst
University | Québec, Québec, Canada
Processing and statistical analysis of education data for identifying determinants of school performance. Development of an interactive R Shiny interface for visualization and exploration of results.
Chief Statistician
CRES | Dakar, Senegal
Design of stratified probabilistic sampling plans for multi-site observational studies ensuring representativeness of target populations. Development of standardized data collection protocols with electronic tools (CSPro) ensuring data quality and traceability. Supervision of collection, harmonization and structuring of multi-source databases from field surveys. Performance of bivariate and multivariate statistical analyses for identifying factors associated with outcomes of interest.
Statistician and Data Manager
ISRA | Dakar, Senegal
Cleaning, harmonization and structuring of multi-source research data for subsequent statistical analyses. Application of anomaly detection and outlier detection techniques to ensure data integrity before analysis. Standardization of data formats according to predefined protocols to facilitate integration into centralized databases.
Data Manager and Data Analyst
StatInfo | Dakar, Senegal
Design and implementation of field data collection systems with interviewer training. Development of sampling methods adapted to study objectives and operational constraints. Creation of data visualizations (graphs, tables) with R and Excel for communicating results to clients. Statistical processing of survey data with R and production of descriptive and inferential analyses.

Projects

Genome Analysis Tool

A comprehensive tool for analyzing genomic sequences with advanced ML algorithms.

PythonBioinformatics
View Project →

ML Genomics Pipeline

Machine learning pipeline for variant calling and disease prediction.

TensorFlowGenomics
View Project →

Data Visualization Dashboard

Interactive dashboard for exploring and visualizing genomic datasets.

JavaScriptReact
View Project →

Certifications

Genomic Data Science

Introduction to Genomic Technologies

Coursera

Course 1 of 6 · Complete

View Certificate →

Python for Genomic Data Science

Coursera

Course 2 of 6 · Complete

View Certificate →

Algorithms for DNA Sequencing

Coursera

Course 3 of 6 · Complete

View Certificate →

Command Line Tools for Genomic Data Science

Coursera

Course 4 of 6 · Complete

View Certificate →

Bioconductor for Genomic Data Science

Coursera

Course 5 of 6 · Complete

View Certificate →

Statistics for Genomic Data Science

Coursera

Course 6 of 6 · Complete

View Certificate →

Bioinformatics

Finding Hidden Messages in DNA (Bioinformatics I)

Coursera

Course 1 of 7 · 77% complete

View Certificate →

Genome Sequencing (Bioinformatics II)

Coursera

Course 2 of 7 · 8% complete

View Certificate →

Comparing Genes, Proteins, and Genomes (Bioinformatics III)

Coursera

Course 3 of 7 · Not started

View Certificate →

Molecular Evolution (Bioinformatics IV)

Coursera

Course 4 of 7 · Not started

View Certificate →

Genomic Data Science and Clustering (Bioinformatics V)

Coursera

Course 5 of 7 · Not started

View Certificate →

Finding Mutations in DNA and Proteins (Bioinformatics VI)

Coursera

Course 6 of 7 · Not started

View Certificate →

Bioinformatics Capstone: Big Data in Biology

Coursera

Course 7 of 7 · Not started

View Certificate →

Get In Touch

Ready to transform your data into knowledge? Let's discuss how I can help you achieve your goals.

Portfolio

Exploring the intersection of data science and bioinformatics to unlock insights from complex biological data.

Connect

© 2025 Portfolio. All rights reserved.