Babacar GUEYE

Passionate Data & AI Engineer with a deep eager to be fully expert on Cloud, AI and automatisation.

Download Resume (PDF)

Babacar

Data Engineer

Competent about data acquisition, storage and transforming it into valuable information. Committed into building and maintaining intelligent systems. Experienced in data science, cloud computing, and full-stack development.

Professional Summary

Results-driven Data Engineer with 3+ years of experience in developing data pipelines and data-driven applications. Proven track record of building scalable ETL processes, optimizing data infrastructure, and delivering impactful insights that drive business growth. Expertise in Python, Apache Spark, AWS, and data engineering technologies. Motivated to staying current with emerging technologies and contributing to open-source projects.

Professional Experience

Freelance Web & AI Developer

Raktiak Studio (link)

09/2024 - 03/2025

France

  • Developed custom web applications using modern frameworks and Python backend technologies
  • Implemented AI solutions and machine learning models for client projects
  • Built scalable web services and APIs for various business requirements
  • Provided technical consulting and development services to multiple clients

Data Engineer & Backend Developer

TrouveTout (Startup)

07/2023 - 05/2024

France

  • Developed ETL processes using Python and Airflow for marketplace data processing
  • Built scalable backend with Django for data ingestion, transformation, and storage
  • Implemented automated CI/CD pipelines with Docker, Jenkins, and GitLab, reducing deployment time by 30%
  • Created data quality monitoring and anomaly detection systems to ensure data integrity

Data Science/Engineer Intern

Orange (Telecom Sector)

11/2022 - 05/2023

France

  • Developed data lake architecture using Hadoop and Spark to handle over 200 TB of telecom data
  • Deployed Kafka for real-time streaming of customer data, enabling better churn analysis
  • Assisted in migrating on-premises data infrastructure to AWS, utilizing S3, Glue, and Redshift
  • Supported data science team by providing scalable data pipelines for churn prediction using Apache Spark

Data Analyst Intern

Laboratoire de Mathématiques Nicolas ORESME

03/2021 - 07/2021

France

  • Optimized data pipelines for research projects, reducing processing time by 20%
  • Developed interactive dashboards with SAS Visual Analytics for better variable relationship understanding
  • Conducted statistical analysis and data transformation for research initiatives

Education

DU Big Data, Data Science et Analyse des risques

Université de Montpellier

Montpellier, France10/2023 - 09/2024

GPA:

Big DataData ScienceRisk AnalysisStatistical Modeling

Master 2 Statistiques Appliquées et Analyse Décisionnelle

Université de Caen

Caen, France09/2021 - 09/2023

GPA:

Applied StatisticsDecision AnalysisStatistical ModelingData Analysis

Licence 3 Mathématiques Appliquées et Licence 3 Informatique

Université de Caen

Caen, France09/2019 - 09/2021

GPA:

Applied MathematicsComputer ScienceAlgorithmsProgramming

Technical Skills

Data Engineering

Apache KafkaHadoopSparkAirflowAWS GlueDatabricks

ETL & Data Pipeline

Python (Pandas, PySpark)SQLAirflowTalend

Cloud & DevOps

AWS (S3, EC2, Lambda)GCPDockerKubernetesTerraformAnsible

Big Data Tools

Apache HadoopHiveSparkHDFS

Database Management

SQL/NoSQLPostgreSQLMongoDBCassandra

Data Analytics & Modeling

Dimensional ModelingOLAPData Warehousing

Version Control & CI/CD

GitJenkinsGitLab CI

Project Management

Agile/ScrumJiraTrelloSlack

Certifications

AWS Certified Solutions Architect

Amazon Web Services

2023

Apache Spark Developer Certification

Databricks

2023

Docker Certified Associate

Docker Inc

2022

Kubernetes Administrator

Cloud Native Computing Foundation

2022

Interested in working together?

Let's discuss how I can contribute to your next project

Get In Touch